Defects in drug metabolism

ABSTRACT

PCT No. PCT/US95/07605 Sec. 371 Date Apr. 7, 1997 Sec. 102(e) Date Apr. 7, 1997 PCT Filed Jun. 16, 1995 PCT Pub. No. WO95/34679 PCT Pub. Date Dec. 21, 1995The invention relates to genetic material, and specifically portions of DNA, for identifying the presence or absence of a mutation in the drug metabolism gene CYP2C9 and CYP2A6. Further, the invention comprises a method for determining such mutations and a kit incorporating the genetic material of the invention for performing the said methods so as to determine the presence or absence of mutations in the drug metabolizing gene CYP2C9 and CYP2A6.

FIELD OF THE INVENTION

The invention relates to genetic material, specifically primers, for use in a method designed to determine the genotype of an individual; and also a kit, including the genetic material of the invention, for performing the method of the invention.

BACKGROUND OF THE INVENTION

It is well known that genetic polymorphisms in drug metabolizing genes give rise to a variety of phenotypes. This information has been used to advantage in the past for developing genetic assays that predict phenotype and thus predict an individual's ability to metabolize a given drug. The information is of particular value in determining the likely side effects and therapeutic failures of various drugs. The availability of this sort of information will result in routine phenotyping being recommended for certain categories of patients.

Drug metabolism is carried out by the cytochrome P450 family of enzymes. For example, the cytochrome P450 isozyme gene, CYP2C9 encodes a high affinity hepatic S!-warfarin 7-hydroxylase which appears to be principally responsible for the metabolic clearance of the most potent enantiomer of warfarin. Similarly, the cytochrome P450 isozyme gene, CYP2A6, encodes a protein that metabolizes nicotine and coumarin and activates the tobacco-specific nitrosamine 4-(methyinitrosamino)-1-(3-pyridyl)-1-butanone) (NNK).

It is of note that the above gene products are also known to metabolize other substrates, for example, the CYP2C9 gene product is also known to metabolize Tolbutamide, Phenytoin, Ibuprofen, Naproxen, Tienilic acid, Diclofenac and Tetrahydrocannabinol.

It follows that genetic polymorphisms or mutations in either of the two aforementioned genes can lead to an impairment in metabolism of at least the aforementioned drugs.

In so far as CYP2C9 is concerned, sequences reported by Yasumori et al (1987 J. Biochem. 102:1075-1082.) and Kimura et al (1987 Nuc. Acids Res. 15:10053-10054) show differences at several positions including a C to T base change that results in a Arginine/Cysteine polymorphism at amino acid 144. This polymorphism has been designated R144C.

In so far as CYP2A6 is concerned, a T to A base change at position 488 of the cDNA sequence described by Yamano et al (1990 Biochemistry 29:1322-1329) results in substitution of Leucine 160 by Histidine. Henceforth this mutant form of the gene will be designated CYP2A6v1.

The variant CYP2A6v1 encodes an enzyme that is unstable and catalytically inactive. It is found in the general population at a frequency of about 1% but does not account for all slow metabolizers of coumarin.

Since the cDNA sequence structure of CYP2C9 and CYP2A6 are known, and since it is also known to perform genetic assays to determine whether a preselected mutation is present within a given gene, it should, in theory, be possible to design assays which specifically determine whether either of the aforementioned mutations are present in each of the respective aforementioned genes.

However, we have found an extraordinarily high degree of exon homology in the cytochrome P450 genes. This has resulted in non-specific binding of assay materials and poor performance of assays. In the instance where primers have been used to hybridize to genetic material, non-specific binding of such primers has taken place, and in the further instance where primers have been used to hybridize to genetic material with a view to performing a polymerase chain reactions we have found that related genes have also been amplified, for example, CYP2A7, CYP2A12 and CYP2C8 have also been amplified.

SUMMARY OF THE INVENTION

The present invention relates to novel variant alleles in cytochrome P450 genes which express enzymes involved in the metabolism of particular drugs and/or chemical carcinogens.

One object of the present invention relates to the discovery of new mutant or variant CYP2A6 alleles wherein the human gene is characterized. A new variant allele has been found which is designated CYP2A6v2. The cDNA and genomic sequence of CYP2A6v2 is provided in the present invention. Another new gene related to CYP2A6 has been discovered and is designated CYP2A13. The cDNA and genomic sequence of CYP2A13 is provided in the present invention.

Another object of the present invention relates to the use of intron sequences to specifically identify CYP2A6 and CYP2C9 variants in a gene specific detection assay.

Another object of the present invention is to use an oligonucleotide probe, specific for regions unique to a particular CYP2 variant to screen for the presence or absence of the variant in a sample.

Yet another object of the invention is to provide genetic material, a method, and a kit which enable genotyping of the CYP2C9 and CYP2A6 gene with a view to providing phenotypic information concerning drug metabolism.

A further object of the present invention provides a method for diagnostically determining the sensitivity of a patient for specific drugs and chemical carcinogens. Such a method is widely applicable in determining the proper dosage of a drug for a patient.

Another object of the present invention provides a method of genotyping CYP2A6 and CYP2C9 and determining whether a mutation has altered the sequence of these genes and hence altered sensitivity to particular drugs and chemical carcinogens.

In accordance with the present invention a method is provided which utilizes the finding that each variant of a CYP2 gene has specific nucleotide differences as compared with the wild-type CYP2 gene. Such nucleotide changes can be utilized in a probe-hybridization assay, which is capable of specifically detecting a chosen variant and not other variants.

The present invention also provides a genotyping method for identifying the presence or absence of a mutation at codon 144 of the coding sequence of CYP2C9, or alternatively, at codon 160 of the coding sequence of CYP2A6, or alternatively, a gene conversion event involving CYP2A6 and CYP2A7 in exons 3, 6 or 8 comprising use of a portion of DNA. Such a mutation is then correlated to the sensitivity of particular drugs and chemical carcinogens.

The present invention further relates to a gene-specific bioassay which is capable of distinguishing between the CYP2 genes and identify the presence or absence of a mutation in CYP2A6 and CYP2C9 genes. Such a bioassay can diagnostically predict the sensitivity of an individual to particular drugs or chemical carcinogens. For example, the CYP2C9 variants identify a sensitivity to a commonly used anti-coagulant drug, warfarin. The CYP2A6 variants identify sensitivity to coumarin, nicotine and nitrosamines. The sensitivity to nicotine may be used to predict a predisposition to tobacco-related diseases, a propensity to smoking and adverse reactions to exposure to nicotine. Further, CYP2A6 genes are associated with the activation of nitrosamines, elevated levels of which have been correlated with many cancers.

The present invention also provides a method of genotyping the CYP2A6 and CYP2C9 genes using allele-specific amplification reaction.

In addition, a highly-specific combination genotyping bioassay has been developed to identify mutations within CYP2A6 and CYP2C9 which are linked to sensitivity to particular drugs and chemical carcinogens. This combination bioassay comprises a gene-specific amplification reaction, an exon-specific amplification reaction and an endonuclease cleavage reaction wherein only one form, either mutant or wild-type is cleaved, producing either a single nucleic acid fragment or multiply nucleic acid fragments depending upon the presence or absence of the mutation. For example, one CYP2C9 variant, R144C, which contains a C₄₇₂ →T mutation can be identified by an AvaII restriction site. CYP2A6 variants can also be identified by their corresponding mutations. CYP2A6v1 which contains a T₄₈₈ →A mutation can be identified by a XcmI restriction site. CYP2A6v2 which contains a T₄₁₅ →A mutation can be identified by a DdeI restriction site.

The present invention also relates to a method for screening patients for drug sensitivity prior to their treatment with that drug, thereby alerting a physician of a drug sensitivity. In addition, the method may be used to screen patients for a predisposition to cancers related to excessive nitrosamine activation, which are associated with mutations within the CYP2A6 gene locus. Further, the method may be used to screen patients for a sensitivity to chemical carcinogens, based upon the genotype of the CYP2A6 and/or CYP2C9 alleles.

One such new allele variant, CYP2A6v2, has 98% nucleotide similarity and 80% amino acid similarity with the wild type CYP2A6, respectively. The present invention relates to the new CYP2A6v2 variant, the cDNA sequence and its genomic sequence wherein the alterations in sequence are within exons 3, 6 and 8, which are attributed to a gene conversion. In addition, another new gene, also involved in drug metabolism has been identified, and has been designated CYP2A13. This gene plays a similar role in drug metabolism as CYP2A6. These new gene sequences or fragments thereof are used as probes in identifying specific CYP2 variants in samples. In additions, fragments of the new genes are used as primers in a genotyping assay.

The invention further provides isolated CYP2Av2 and CYP2A13 cDNAs for use in gene therapy and replacement protocols for individuals who are predisposed to sensitivity to needed drugs or to chemical or environmental carcinogens.

In accordance with an aspect of the present invention, there are provided primary human cells which are genetically engineered with CYP2A6v2 or CYP2A13 DNA (RNA) which encodes a therapeutic agent of interest, and the genetically engineered cells are employed as a therapeutic agent. (The term "therapeutic," as used herein, includes treatment and/or prophylaxis.) Gene expression in an organism in accordance with the practices of this invention is regulated, inhibited and/or controlled by incorporating in or along with the genetic material of the organism non-native DNA which transcribes to produce an RNA which is complementary to and capable of binding or hybridizing to a mRNA produced by a gene located within said organism. Upon binding to or hybridization with the mRNA, the translation of the MRNA is prevented. Consequently, the protein coded for by the mRNA is not produced. In the instance where the mRNA translated product, e.g. protein, is vital to the growth of the organism or cellular material, the organism is so transformed or altered such that it becomes, at least, disabled.

Accordingly, in the practices of this invention from a genetic point of view as evidenced by gene expression, new organisms are readily produced. Further, the practices of this invention provide a powerful tool or technique for altering gene expression or organisms through gene therapy. The practices of this invention may cause the organisms to be disabled or incapable of functioning normally or may impart special properties thereto. The DNA of CYP2A6v2 or CYP2A13 employed in the practices of this invention can be incorporated into the treated or effected organisms by direct introduction into the nucleus of a eukaryotic organism or by way of a plasmid or suitable vector containing the special DNA of this invention in the case of a procaryotic organism.

BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments of the invention are described by way of example only with reference to the accompanying figures wherein:

FIG. 1 Shows the sequence of exon 2, intron 2 and exon 3 of CYP2C8 and CYP2C9, cDNA sequences (from 4) are shown at the top of the page together with sequences from 6 genomic clones encompassing exon 2, intron 2 and exon 3 of CYP2C8 and CYP2C9. The position of the polymorphism at codon 144 of CYP2C9 and the PCR primers are indicated.

FIG. 2 Shows the sequence of intron 2, exon 3 and intron 3 of CYP2A6, CYP2A7 and CYP2A12. The position of the polymorphism at codon 160 in CYP2A6 and the PCR primers are indicated.

FIG. 3 Shows the detection of CYP2C9 Arg₁₄₄ Cys polymorphism by PCR. Following amplification, samples were digested with AvaII and analyzed on a 1.8% agarose gel. Lane I and lanes 3 to 6 show homozygous wild-type subjects, lane 2 a heterozygous individual and lane 7 undigested PCR product.

FIG. 4 Shows detection of CYP2A6 Leu₁₆₀. His polymorphism by PCR. Two parallel PCR reactions were carried out and the products analyzed on a 1% agarose gel. Lanes 1, 3, 5 and 7 show the results of the wild-type specific assay and lanes 2, 4, 6 and 8 the results of the variant-specific assay for the same four subjects. Subjects I and 2 (lanes 1-4) are homozygous wild-type, subject 3 (lanes 5 and 6) heterozygous and subject 4 (lanes 7 and 8) homozygous for the mutation.

FIG. 5 Shows distribution of the weekly maintenance doses for warfarin in patients (n=57) homozygous for the CYP2C9 wild-type allele (open bars) and heterozygous (n=37) for the R144C mutant allele (solid bars). Arrows show the median weekly dose requirement of warfarin for each genotype.

FIG. 6 Represents 7-hydroxylation of coumarin (%) in a family genotyped for the CYP2A6 and CYP2A6v1 alleles, showing a subject homozygous for the CYP2A6v1 allele who is deficient in coumarin 7-hydroxylation.

FIG. 7 Shows the difference between the genomic and cDNA sequences for the CYP2A6 gene.

FIGS. 8a and b. Shows the conversion event which leads to the CYP2A6v2 allele.

FIGS. 9a through 9c. Shows the detection of CYP2A6v2 by PCR. (FIG. 9A) gene-specific amplification by PCR of the CYP2A6 gene using E3F and E3R. Lanes 1 to 4 show the 7.8 Kb band obtained from several representative human genomic DNA templates, lane 5 correspond to a negative control in the absence of template and lane 6 contains 1 Kb DNA ladder (GIBCO BRL) as six markers. (FIG. 9B) Exon-specific PCR amplification of exon 3 from the 7.8 Kb long-PCR product and restriction endonuclease pattern obtained after digestion with XcmI (left) and DdeI (right) to detect the CYP2A6v1 and CYP2A6v2 alleles, respectively. The genotypes shown correspond to: wild type (+/+), heterozygous (+/-) and homozygous (-/-) subjects. (C) The genotyping strategy which has been developed. Exons are indicated by boxes. The position of the corresponding primer pairs are indicated by horizontal arrows. XcmI and DdeI restriction sites generate digestion patterns for the different alleles having fragment sizes as shown.

FIG. 10 Schematic diagram depicting methodology underlying a CYP2C9 genotyping assay.

FIG. 11. CYP2A6v2 cDNA sequence (SEQ ID NO: 1).

FIG. 12. CYP2A6v2 genomic DNA sequence having 7216 base pairs (SEQ ID NO: 3)

FIG. 13. CYP 2A13 CDNA sequence (SEQ ID NO: 2).

FIG. 14. CYP2A13 genomic DNA sequence having 8779 base pairs (SEQ ID NO: 4).

FIG. 15. Agarose minigel electrophoresis of PCR products. The CYP2C9 wild-type allele (Arg-144) and R144C respectively, Lanes marked "+/+" and "+/-" contain homozygous wild types and heterozygotes respectively. the right-hand lane contains a 100 bp ladder.

DETAILED DESCRIPTION OF THE INVENTION

The cytochrome P450 isozyme gene, CYP2C9 encodes a high affinity hepatic S!-warfarin 7-hydroxylase which appears to be principally responsible for the metabolic clearance of the most potent enantiomer of warfarin along with metabolizing a number of other drugs and chemical carcinogens. Similarly, the cytochrome P450 isozyme gene, CYP2A6, encodes a protein that metabolizes nicotine, coumarin and a host of other drugs and chemical carcinogens CYP2A6 also activates the tobacco-specific nitrosamine 4-(methylnitrosamino)-1-(3-pyridyl)-1-butanone (herein referred to as "NNK"). Many cancers have been associated with activation and/or accumulation of nitrosamines. The present invention allows detection of a predisposition to such cancers.

It is of note that the above gene products are also known to metabolize other substrates. For example, the CYP2C9 gene product is also known to metabolize Tolbutamide, Phenytoin, Ibuprofen, Imipramine, Naproxen, Tienilic acid, Diclofenac and Tetrahydrocannabinol and hence can also be used to detect sensitivities to these drugs. A list of CYP2C9 drug substrates has been documented and is incorporated herein by reference (Gonzalez & Idle 1994 Clin. Pharmacokinet 26:59-70). Hence, the present invention can be used to screen for sensitivities to these drugs.

In addition, CYP2C9 has been associated with the metabolism of chemical carcinogens, such as polycyclic aromatic hydrocarbons. For example, the most ubiquitous environmental carcinogen, benz- a!-pyrene is metabolized by CYP2C9. Benz- a!-pyrene is found in tobacco, barbecued meats, car exhaust and generally, in polluted air. This compound, as it accumulates in the body becomes a potent DNA intercalating agent, ultimately resulting in cell transformation and the formation of tumors. The present invention provides a diagnostic method of screening individuals for their ability to metabolize and hence inactivate benz- a!-pyrene. For example, a homozygote wild-type CYP2C9 individual would be better able to tolerate high levels of benz- a!-pyrene than a heterozygote of the CYP2C9 allele.

Similarly, the CYP2A6 allele is associated with drug sensitivity and carcinogen metabolism. Coumarin sensitivity is directly related to the presence of a variant CYP2A6 allele, such as CYP2A6v1, CYP2A6v2 and also CYP2A13. Coumarin is a drug used in treatment of neoplastic diseases, such as lymphomas. (See Martindale: The Extra Pharmacopoeia 1993 Ed. Reynolds, J. E. F., The Pharmaceutical Press, London, p. 1358). Its suggested dosage is very high. Therefore, the present invention is useful in determining a patient's sensitivity to the drug in order to prescribe a proper dosage and avoid toxicity.

Another drug, Thiotepa™, is used in the treatment of a variety of neoplastic diseases, such as in treating women with breast cancer and children with brain tumors. Thiotepa is metabolized by CYP2A6 into Tepa, which is an intermediate more therapeutically potent than Thiotepa. Therefore, if a patient has a very active CYP2A6 enzyme, it is likely the patient will require lower doses of Thiotepa to provide a therapeutically effective amount. As one can see, the dosage provided to a patient is dependent upon the rate a patient is capable of metabolizing activating the drug. The present invention has identified variant alleles whose enzymatic activity is compromised. In addition, the present invention provides a simple method of genotyping patients for Thiotepa drug sensitivity. With information concerning patient sensitivity to such drugs, the proper dosage can be provided, hence maximizing drug efficiency and minimizing drug toxicity.

Further, CYP2A6 has been associated with nicotine metabolism. In addition to being an active ingredient in tobacco, nicotine also has several clinical uses. Nicotine is used clinically to treat various neurological disorders, such as Parkinson's disease and Alzheimer's disease. In addition, nicotine is used to treat tobacco addiction. In all of these situations, it is important to know a patient's sensitivity to nicotine, since extremely sensitive patients will become violently ill upon administration of nicotine. Therefore the present invention provides a method of identifying nicotine-sensitive patients by genotyping a patient's CYP2A6 allele. The present invention also provides a convenient method for determining an individual's general predisposition to using tobacco based upon their sensitivity to nicotine.

In addition, CYP2A6 is involved in activating nitrosamines, thereby producing the potent carcinogen NNK. Increased levels of NNK have been associated with a variety of cancers, including but not limited to lung cancer, nasal-pharynx cancers, throat cancers and colon cancers. In general, elevated levels of CYP2A6 has been associated with cancers associated with exposure to nitrosamines. The present invention may detect a patient's predisposition to such cancers. The presence of a CYP2A6 gene or a variant thereof will affect the likelihood that procarcinogens present in tobacco smoke will be activated into carcinogenic nitrosamines and nitrosamine-derivatives and therefore result in the development of a cancer.

It follows that genetic polymorphisms or mutations in either of the two aforementioned genes can lead to an impairment in metabolism of at least the aforementioned drugs and chemical carcinogens.

The present invention relates to the identification of the absence or presence of mutations in CYP2C9 and CYP2A6 and thus predict the phenotype of an individual and so predict whether and how an individual is likely to metabolize particular drugs and chemical carcinogens. For instance, the R144C mutation arising from a C₄₇₂ →T base substitution in the CYP2C9 gene results in a reduction in warfarin metabolism. This implies that patients with this mutation receiving warfarin require a lower dose to maintain an anticoagulation target than those patients who do not have the mutation and are also receiving warfarin. Conversely, homozygous wild-types require higher doses in order to maintain an anticoagulation target.

"Mutation", as the term is used herein denotes an allelic variation of a known sequence, which alters the expressed gene product's activity. Such a variation need not completely inactivate the gene product's activity but merely alter it.

Similarly, one mutation within CYP2A6v1 arising from a T₄₈₈ →A base change results in substitution of Leucine 160 by Histidine. Another CYP2A6 variant, CYP2A6v2, has been identified which differs from CYP2A6 in the regions of exons 3, 6 and 8. One particular mutation in CYP2A6v2, T₄₁₅ →A mutation is useful in the assay of the present invention. These substitutions are very useful in detecting predispositions to cancers associated with tobacco and activation of nitrosamines. The normal CYP2A6 enzyme functions in the metabolism of nicotine, one of the carcinogenic compounds in tobacco.

In addition, the present invention relates to the identification of a new variant of CYP2A6 designated CYP2A6v2. The variations of CYP2A6v2 from CYP2A6 bear sequence relatedness with the corresponding exons of the CYP2A7 gene, suggesting a recent gene conversion. The cDNA and genomic sequence for this gene is provided in the present invention. Hence, at least three different allelic variants of CYP2A6 exist and are illustrated in FIG. 8. These allelic variants include CYP2A6, CYP2A6v1 and CYP2A6v2.

Further, the present invention relates to a new CYP2A gene, designated CYP2A13. This gene produces an inactive form of CYP2A6, however variants at particular positions, including amino acid positions 117, 209 and 365 produce an enzyme which may alter the enzyme's activity and hence affect drug sensitivity. These mutations in CYP2A6 are likely to result in a deficiency or impaired activity of one of the enzymes responsible, for example, for metabolizing drugs, nicotine and nitrosamines.

CYP2A13 is considered a new cytochrome P450 gene. However, since the CYP2A13 gene product has a similar function as the CYP2A6, it is discussed herein as a variant of CYP2A6. That is, assays using the specific mutated amino acid positions 117, 209 and 365 of CYP2A13 and detecting variations at those positions are indicative of CYP2A6-like variant functions.

In one embodiment, the CYP2A6v2 or CYP2A13 proteins or functional portions thereof are expressed as recombinant genes in a cell, so that the cells may be transplanted into an individual in need of gene therapy due to the predisposition to a carcinogen-associated cancer or a sensitivity to a drug. To provide gene therapy to an individual, a genetic sequence which encodes for all or part of the CYP2A6v2 or CYP2A13 ligands are inserted into vectors and introduced into host cells. Examples of vectors that may be used in gene therapy include, but are not limited to, defective retroviral, adenoviral, or other viral vectors (see, e.g., Mulligan, R. C., 1993, Science, 260:926-932). The means by which the vector carrying the gene may be introduced into the cell includes, but is not limited to, microinjection, electroporation, transduction, or transfection using DEA-dextran, lipofection, calcium phosphate or other procedures known to the skilled routineer (see, e.g., Sambrook et. al. (Eds.), 1989, In "Molecular Cloning. A Laboratory Manual", Cold Spring Harbor Press, Plainview, N.Y.). Examples of cells into which the vector carrying the gene may be introduced include, but are not limited to, continuous culture cells, such as COS, NIH/3T3, and primary or culture cells of the relevant tissue type.

More specifically, there is provided a method of enhancing the therapeutic effects of blood cells, that are infused in a patient, comprising: (i) inserting into the blood cells of a patient a DNA (RNA) segment encoding CYP2A6v2 or CYP2A13 gene product that enhances the therapeutic effects of the blood cells; and (ii) introducing cells resulting from step (i) into the patient under conditions such that the cells resulting from step (i) "target" to a tissue site. In the alternative, as previously described the cells are not "targeted" and functions as a systemic therapeutic. The genes are inserted in such a manner that the patient's transformed blood cell will produce the agent in the patient's body. In the case of antigen-specific blood cells which are specific for an antigen present at the tissue site, the specificity of the blood cells for the antigen is not lost when the cell produces the product.

Alternatively, as hereinabove indicated, CYP2A6v2 or CYP2A13 DNA (RNA) may be inserted into the blood cells of a patient, in vivo, by administering such DNA (RNA) in a vehicle which targets such blood cells.

Further details regarding methods of gene therapy are provided in Anderson et al., U.S. Pat. No. 5,399,343 which is herewith incorporated herein by reference.

In another embodiment, antisense CYP2A6v2 or CYP2A13 DNA or RNA may be used to control the expression of CYP2 gene. For example, antisense therapy may be used to control CYP2A6's ability to activate dangerous nitrosamines by curbing its expression. Methods of producing such antisense molecules are described in U.S. Pat. No. 5,190,931, which is incorporated herein by reference.

Developing a genotyping assay, which could distinguish the CYP2 genes of interest from other cytochrome P450 genes required careful engineering since these genes have a high degree of sequence homology. To overcome this problem, one embodiment of the present invention has elucidated the genomic sequence structure of CYP2C9 and CYP2A6 with a view to making, in part, intron specific primers. That is to say primers which, in part, hybridize to at least one intron, preferably an intron adjacent to an exon including the mutation of interest, in the gene to be examined. Since there is less homology between the introns of cytochrome P450 genes, it has been found that using intron specific primers, gene specific assay can be undertaken. The present invention has a further advantage of using intron specific primers in so far as the use of such primers facilitates the manufacture of an optimum length of DNA which in turn facilitates the specificity of the instant bioassay.

A "genotyping" assay as the term is used herein refers to any diagnostic or predictive test to detect the presence or absence of allelic variants of a known gene sequence at a specified gene locus. Two gene loci are of particular interest in the present invention, CYP2A6 and CYP2C9.

Further, the present invention relates to differences between the genomic DNA sequence structure and the cDNA sequence structure, as illustrated in FIG. 7. As a result, primers directed at the genomic sequence structure have been developed which are more reliable.

Several methods are provided for identifying the presence or absence of a mutation at codon 144 of the coding sequence of CYP2C9, or alternatively, at codon 160 of the coding sequence of CYP2A6, or alternatively, a gene conversion event involving CYP2A6 and CYP2A7 in exons 3, 6 or 8 comprising a DNA encompassing the region of a CYP2 gene unique to that variant.

One such method relates to an assay which contemplates the use of one specific primer which specifically encompasses the region containing the mutation, and a second primer which is complementary to another portion of the gene. The second primer sequence chosen is based upon the CYP2A6, CYP2C9 or CYP2A13 sequences as set forth in FIGS. 12, 1 and 14, respectively, depending upon the preferred size of the amplification product. One skilled in the art will know how to select second primer based on the region of gene chosen for amplification. These primers need not be identical to a given sequence but must be sufficiently complementary to hybridize to the target region in a specific manner. In short, the primers are preferably at least substantially homologous to the nucleic acid sequence provided.

Nucleic acid sequences includes, but is not limited to, DNA, RNA or cDNA. Nucleic acid sequence as used herein refers to an isolated nucleic acid sequence. Substantially homologous as used herein refers to substantial correspondence between the nucleic acid primer sequence of as described herein and that of any other nucleic acid sequence. Substantially homologous means about 50-100% homologous homology, preferably by about 70-100% homology, and most preferably about 90-100% homology between the particular sequence discussed and that of any other nucleic acid sequence.

In the instant application, the term "primer" is further used to designate a molecule comprising at least three nucleotides, the exact length being determined by the requisite amount of DNA needed, under given reaction conditions, to bind to or interact with a test sample so as to identify the presence or absence of either of said mutations. Preferably, the primer is usually between 15 and ideally about 20 to 50 oligonucleotides in length.

The primer is selected, or adapted, to be substantially complementary to a part of DNA which is adjacent to the region of at least one of the aforementioned mutations. Thus such a primer is able is hybridize with a part of DNA that contains a region in which the mutation of interest may be found. Although the primer may not reflect the exact sequence of the region in which the mutation is thought to occur, the more closely the primer is to this sequence, then the better the binding will be. Ideally, the more closely the sequence of the 3' end of the primer is to said region the better the binding or interaction will be.

An alternative method for using the sequence unique to a variant for detection relates to use of an oligonucleotide probe for specifically detecting the presence or absence of a CYP2 variant gene in a sample. this method comprises the steps of contacting the sample with a nucleic acid probe, allowing hybridization, forming a probe: CYP2 variant complex; washing excess probe from probe: CYP2 variant complex; and detecting probe: CYP2 variant complex, wherein a positive signal is an indication of the presence of the CYP2 variant in the sample.

The hybridization of the probe to sample nucleic acids can be carried out by any of the methods commonly used in the art. Such methods include but are not limited to, Dot blot, Colony hybridization, Southern blot, solution hybridization and in situ hybridization.

Washing the excess probe from the probe: CYP2 variant DNA can be accomplished by many well-known methods. Simply rinsing the complex with excess buffer will facilitate removal of excess probe. Alternatively, washing may entail separating the probe: CYP2 variant complex from excess probe. Many methods are known to one skilled in the art and include but are not limited to centrifugation, filtration and magnetic force.

According to the present invention there is provided a portion of DNA suitable for use as a primer in a method for identifying the presence or absence of a mutation either at codon 144 of the coding sequence of the gene CYP2C9, or alternatively, at least one gene conversion event involving CYP2A6 and CYP2A7 in exons 3, 6 or 8, or alternatively, at codon 160 of the coding sequence of the gene CYP2A6; comprising a DNA which is adapted to hybridize to at least one intron of at least one of said genes.

In one embodiment, the method comprises the use of at least one restriction endonuclease to digest DNA from individuals to be tested. In this instance, DNA from individuals positive for the wild-type form of CYP2C9 provide a digest with a restriction endonuclease, such as AvaII results in production of two fragments, a first fragment including 270 base pairs and a second fragment including 50 base pairs. In contrast, individuals having the aforementioned mutation in CYP2C9 present a single fragment of 320 base pairs only. This is due to a loss of the AvaII site. The CYP2A6 gene variants can also be distinguished by the occurrence of specific restriction endonuclease sites. The CYP2A6v1 variant, which is a T₄₈₈ →A mutation in exon 3 can be identified by a variant-specific XcmI restriction site. The CYP2A6v2 variant, which contains a C₄₁₅ →A mutation within exon 3 can be identified by a variant-specific DdeI restriction site. The wild-type CYP2A6 gene does not contain either an XcmI or DdeI site. The results of such restriction endonuclease digestions are illustrated in FIG. 9.

It may be necessary to amplify the DNA prior to digestion. Such may be the case when the DNA of interest is present in minute quantities in a sample. In such circumstances, amplification of DNA to be tested is undertaken before digesting the DNA as described above. This provides for a greater quantity of materials. Amplification is performed using any conventional technique, such as by a PCR reaction. Many other techniques for amplification can be used in producing sufficient DNA for detections. Such amplification techniques are well-known to the skilled artisan and include, but are not limited to polymerase chain reaction (PCR), PCR in situ, ligase amplification reaction (LAR), ligase hybridization, QB bacteriophage replicase, transcription-based amplification system (TAS), genomic amplification with transcript sequencing (GAWTS) and nucleic acid sequence-based amplification (NASBA). A general review of these methods is available in Landegren, et al., Science 242:229-237 (1988) and Lewis, R., Genetic Engineering News 10:1, 54-55 (1990), which is incorporated herein by reference.

One embodiment of the present invention uses oligonucleotide primers in an amplification and detection assay. A basic description of nucleic acid amplification is described in Mullis, U.S. Pat. No. 4,683,202, which is incorporated herein by reference. The amplification reaction uses a template nucleic acid contained in a sample, two primer sequences and inducing agents. The extension product of one primer when hybridized to the second primer becomes a template for the production of a complementary extension product and vice versa, and the process is repeated as often as is necessary to produce a detectable amount of the sequence.

The inducing agent may be any compound or system which will function to accomplish the synthesis of primer extension products, including enzymes. Suitable enzymes for this purpose include, for example, E.coli lDNA polymerase I, thermostable Taq DNA polymerase, Klenow fragment of E.coli DNA polymerase I, T4 DNA polymerase, other available DNA polymerases, reverse transcriptase and other enzymes which will facilitate combination of the nucleotides in the proper manner to form amplification products.

A sample being screened for the presence or absence of a mutation in CYP2A6 and/or CYP2C9 genes can be tested with the instant invention. The nucleic acid material can be in purified or nonpurified form, provided the sample contains the CYP2A6 and/or CYP2C9 genes. The sample may be derived from any tissue or bodily fluid, wherein the patient's DNA can be found. A clinically practical type of sample is a blood specimen which contains patient DNA and can conveniently be genotyped in the bioassay of the present invention.

The "primers", as the term is used in the present invention refers to an oligonucleotide, whether occurring naturally as in a purified restriction digest or produced synthetically, which is capable of acting as a point of initiation of synthesis when placed under conditions wherein synthesis of a primer extension product which is complementary to a nucleic acid strand is induced, i.e. in the presence of nucleotides and an inducing agent such as DNA polymerase and at a suitable temperature and pH. The primers are preferably single stranded for maximum efficiency in amplification, but may alternatively be double stranded. If double stranded, the primer is first treated to separate its strands before being used to prepare amplification products. Preferably, the primers are oligodeoxyribonucleotides. The primers must be sufficiently long to prime the synthesis of extension products in the presence of the inducing agent. The exact lengths of the primers will depend on many factors, including temperature, source of primer and use of the method. For diagnostic methods, the primers typically contain at least 10 or more nucleotides. The oligonucleotide primers may be prepared using any suitable method, such as, for example, the phosphotriester and phosphodiester methods (Narang, S. A., et al., Meth. Enzymol. 68:90 (1979); Brown E. L., et al., Meth. Enzymol., 68:109 (1979)) or automated embodiments thereof. In one such automated embodiment diethylphosphoramidites are used as starting materials and may be synthesized as described by Beaucage et al., Tetrahedron Letters 22:1859-1962 (1981). One method for synthesizing oligonucleotides on a modified solid support is described in U.S. Pat. No. 4,458,066. It is also possible to use a primer which has been isolated from a biological source (such as a restriction endonuclease digest).

In a genotyping bioassay of the present invention, one embodiment comprises a gene-specific amplification reaction, an exon-specific amplification reaction and a restriction endonuclease reaction. In such a reaction a suitable polynucleotide polymerase is used in the amplification reaction, many of which have already been described in the art. In addition, any appropriate restriction endonuclease which is designed to digest the DNA and so provide information concerning genotype may be used.

It may further be necessary to provide a label on the nucleic acid for detection. The nucleic acid can be DNA or RNA and made detectable by any of the many labeling techniques readily available and known to the skilled artisan. Such methods include, but are not limited to, radio-labelling, digoxygenin-labeling, and biotin-labeling. A well-known method of labeling DNA is ³² P using DNA polymerase, Klenow enzyme or polynucleotide kinase. In addition, there are known non-radioactive techniques for signal amplification including methods for attaching chemical moieties to pyrimidine and purine rings (Dale, R. N. K. et al. 1973 Proc. Natl. Acad. Sci. USA, 70:2238-2242; Heck, R. F. 1968 S. Am. Chem. Soc., 90:5518-5523), methods which allow detection by chemiluminescence (Barton, S. K. et al. 1992 J. Am. Chem. Soc., 114:8736-8740) and methods utilizing biotinylated nucleic acid probes (Johnson, T. K. et al. 1983 Anal. Biochem., 133:125-131; Erickson, P. F. et al. 1982 J. of Immunology Methods, 51:241-249; Matthaei, F. S. et al 1986 Anal. Biochem., 157:123-128) and methods which allow detection by fluorescence using commercially available products. Non-radioactive labelling kits are also commercially available. Such a label can readily be incorporated into the nucleic acid during an amplification step. In the absence of an amplification step, a target nucleic acid can readily be chemically or enzymatically modified to carry a label. Additionally, it may be preferable to provide a labeled primer which may serve to incorporate a label into the nucleic acid target. Probes, as may be used in an embodiment of the invention may also be chemically or enzymatically labeled as described above.

In a preferred embodiment of the invention said DNA primer hybridizes to an intron adjacent said position of said mutation. Preferably said DNA is a primer with the 3'-end specific for the gene of interest. Preferably further still said DNA is single stranded. Preferably further still, in so far as the CYP2C9 mutation is concerned, said primers are as follows:

HF18: position 8 of intron 2 onwards of genomic sequence in forward orientation comprises 5' TGCAAGTGCCTGTTTCAGCA 3' (SEQ ID NO: 5)

HF2R: position 505 onwards of cDNA sequence in reverse orientation comprises 5' AGCCTTGGTTTTTCTCAACTC 3' (SEQ ID NO: 6).

It is of note that both these primers are designed to be specific for CYP2C9 and so do not amplify related genes such as CYP2C8, which notably also has an Arginine₁₄₄ present.

Preferably, in so far as CYP2A6 is concerned, three primers J51, J61 and B are used in two parallel allele-specific PCR reactions. These primers are as follows:

J51 comprises 5' GGCTTCCTCATCGACGCACT 3' (SEQ ID NO: 7) (forward strand from position 479 of cDNA sequence described as hIIA3 (Yamano, et al. 1990 Biochem 29:1322-29)).

J61 comprises 5' GGCTTCCTCATCGACGCACA 3' (SEQ ID NO: 8) (forward strand from position 479 of CDNA sequence described as hIIA3v (Yamano, et al. 1990 Biochem 29:1322-29)).

Both J51 and J61 contain a substitution at position 18 of A for C to give improved specificity as suggested by Newton et al (1989 Nuc. Acids Res. 17:2503-2516).

Primer B comprises 5' AATTCCAGGAGGCAGGGCCT 3' (SEQ ID NO: 9) (reverse orientation from position 125 of intron 3 of CYP2A6 (onwards). Designed so that only CYP2A6 and not CYP2A7 or CYP2A12 are amplified.

One method of genotyping CYP2A6 provides an allele-specific amplification reaction method is used. In this instance, DNA which is adapted to specifically hybridize to the wild-type or the mutant type of the gene is incubated with test DNA under reaction conditions and the resultant products are analyzed by electrophoresis and then visualized by staining with ethidium bromide. Individuals who are homozygous for the wild-type allele produce a reaction product with primer J51 only. Similarly, individuals who are homozygous for the mutation produce a reaction product with primer J61 only. Those individuals who are heterozygous produce a reaction product with both J51 and J61.

Alternatively, another method for genotyping CYP2A6 is provided in a specific amplification bioassay, which is achieved with primers F4 and R4 as follows:

The F4 primer (forward) comprises

5' CCCCTTATCCTCCCTTGCTGGCTGTGTCCCAAGCTAGGCAGGATT CATGGTGGGGCA 3' (SEQ ID NO: 10), wherein a preferred fragment thereof further comprises

5' CCTCCCTTGCTGGCTGTGTCCCAAGCTAGGC 3' (SEQ ID NO: 11).

The R4 primer (reverse) comprises

5' GCCACCACGCCCCTTCCTTTCCGCCATCCTGCCCCCAGTCTTAGC TGCGCCCCTCTC 3' (SEQ ID NO: 12), wherein a preferred fragment thereof further comprises

5' CGCCCCTTCCTTTCCGCCATCCTGCCCCCAG 3' (SEQ ID NO: 13).

This method of CYP2A6 genotyping involves a first amplification reaction with F4 and R4 primers, which generates a DNA fragment approximately 7.8 kb in size. This amplification step is facilitated by polymerases which are capable of transcribing long stretches of DNA. To distinguish the CYP26Av1 and CYP26Av2 variant alleles, an exon-specific amplification step is carried out using the 7.8 Kb DNA fragment as template DNA. This may be accomplished using the following primer pair:

The E3F primer (forward) comprises

5' CCTGATCGACTAGGCGTGGTATTCAGCAACGGGGAGCGCGCCAAG CAGCTCCTG 3' (SEQ ID NO: 14), wherein a preferred fragment thereof further comprises

5' GCGTGGTATTCAGCAACGGG 3' (SEQ ID NO: 15).

The E3R primer (reverse) comprises

5' CGCGCGGGTTCCTCGTCCTGGGTGTTTTCCTTCTCCTGCCCCCGC ACTCGGGATGCG 3' (SEQ ID NO: 16), wherein a preferred fragment thereof further comprises

5' TCGTCCTGGGTGTTTTCCTTC 3' (SEQ ID NO: 17).

Using these primers in a second amplification reaction step a segment of CYP2A6 exon 3 is specifically amplified. The method further comprises use of the restriction endonuclease XcmI to detect the CYP2A6v1 mutation and DdeI to detect the CYP2A6v2 mutation.

According to a yet further aspect of the invention there is provided a kit for performing the afore described methods which kit includes at least a portion of DNA in accordance with the invention and preferably at least one control sample of DNA containing the mutation or mutations of interest and ideally also a wild-type sample of DNA so that suitable comparisons can be made.

It is of note that although the method is described with reference to the above methods, any suitable method using the genetic material of the invention may be used to identify the mutations described herein.

The CYP2C9 assay has been used in a study of warfarin dose requirement in 94 patients undergoing anticoagulant treatment and the results obtained are summarized in FIG. 5. 58 patients (61.7%) were homozygous for the wild-type (Arg₁₄₄) allele and were found to require a median weekly maintenance dose of 31.5 mg of warfarin. 36 patients (38.6%) were heterozygous and required a median weekly maintenance dose of 24.5 mg. The doses required by the two groups were significantly different (Mann-Whitney U-test, p=0.016). No subjects in the group were homozygous for the mutant allele but based on allele frequencies and the Hardy Weinberg equilibrium, the predicted frequency of homozygous mutant subjects is 3.7%.

Comparison of the weekly maintenance dose of warfarin in the R144C heterozygotes (n=36) and homozygous wild-type (n=58) reveals that the heterozygotes required a significantly lower dose (range of 10.5-80.mg). Moreover, of the patients requiring the lowest doses to maintain an anticoagulation target (INR 2.0-4.0), in the range 5-15 mg per week, 9 out of 10 were heterozygous. At the other extreme of weekly doses>55 mg, 5 out of 6 patients were homozygous wild-type for CYP2C9. The significantly lower (20%) warfarin dose requirement of the patients with one variant R144C allele is consistent with the kinetic properties of the R144C protein with respect to (S)-warfarin hydroxylation and presumed in vivo metabolic clearance (Rettie et al. 1994 Pharmocogen., 4:39-42).

The CYP2A6 genotyping assay has been used in studies on coumarin metabolism. Coumarin 7-hydroxylase activity is a convenient marker activity to identify the presence of CYP2A6 in a particular sample. There is considerable variation in the ability of individuals to 7-hydroxylate this compound which is a reaction specific for CYP2A6. A subject deficient in coumarin 7-hydroxylation has been identified. This subject is homozygous for the mutant CYP2A6v1 allele confirming the previous in vitro findings that substitution of Leu160 by His results in loss of coumarin 7-hydroxylase activity. As shown in FIG. 6, CYP2A6 genotyping and phenotyping with coumarin has been performed on other members of the proband's family and impaired coumarin 7-hydroxylation has been observed in heterozygotes for the CYP2A6v1 mutation.

The genotyping assays described herein resulted from a two step amplification reaction wherein first amplification reaction amplifies a 7.8 Kb fragment containing the CYP2A6 gene (FIG. 9A) and a second amplification reaction amplifies an exon-specific fragment of CYP2A6. The amplification product was digested with restriction endonucleases producing different patterns for the various CYP2A6 alleles. Representative results obtained for several human subjects for the detection of the CYP2A6v1 (XcmI digestion) and CYP2A6v2 (DdeI digestion) are shown in FIG. 9 panel B. A schematic depiction of this genotyping assay is shown in FIG. 9, panel C. Of 155 human genomic DNA samples analyzed 21 heterozygous (+/-) and 6 homozygous (-/-) subjects were detected for the CYP2A6v1 allele, whereas 17 heterozygous (+/-) and no homozygous were identified for the CYP2A6v2 allele variant. Additionally, 7 homozygous for both CPYP2A6v1 and CYP2A6v2 alleles were found.

Allelic frequencies were calculated for either allele in several ethic groups and analyzed as shown in Table 1. CYP2A6v1 frequency is almost identical between Caucasian and Japanese, and it is only twice the frequency in Taiwanese samples. Significantly, this allele is completely absent in the African-American population within the samples studied. The Japanese population has a remarkable higher frequency for the CYP2A6v2 allele (28%) as compared to the Caucasian (2%), Taiwanese (6%) or African-American (2.5%) (ethnic groups).

                  TABLE 1     ______________________________________     Allelic frequency for the CYP2A6 gene in     different ethnic groups.              Allelic Frequencies (%)     Ethnic Group                CYP2A6   CYP2A6v1   CYP2A6v2                                            N     ______________________________________     Caucasian  75       23         2       52     Japanese   52       20         28      40     Taiwanese  83       11         6       178     African-American                97.5     0          2.5     40     ______________________________________

The following examples illustrate various aspects of the present invention and in no way are intended to limit the scope thereof. All books, articles, and patents referenced herein are incorporated herein, in toto, by reference. Other similar embodiments will be clear to the skilled artisan and are encompassed within the spirit and purview of the present invention.

EXAMPLE 1

Method for determining the genotype CYP2C9

Genotyping for the CYP2C9 polymorphism is carried out by amplification by PCR followed by digestion with the restriction endonuclease AvaII. Amplifications are performed in 0.5 ml microcentrifuge tubes in a volume of 100 μl containing 10 mM Tris-HCl, pH 8.8, 1.5 mM MgCl2. 50 mM KCl, 0.1% Triton X-100, 5% dimethylsulphoxide, 200 μM each of dTTP, dATP, dCTP and dGTP, 250 μM of the primers HF18 and HF2R, 2.5 units Taq polymerase and 1 μg human leukocyte genomic DNA. PCR conditions consist of 35 cycles with a denaturation at 93° C. for 1 min. annealing at 55° C. for 1.5 min and polymerization at 72° C. for 1 min. 20 μl of the amplified DNA is incubated with 10 units AvaII for 3 h at 37° C. and then analyzed by electrophoresis on 1.8% agarose minigels in TBE (90 mM Tris-borate, 2 mM EDTA) buffer. The digestion products are visualized by ethidium bromide staining. DNA from individuals positive for the wild-type Arg,44 is digested to give fragments of 270 bp and 50 bp whereas in individuals with the mutant Cys₁₄₄ present, a band of 320 bp is seen due to loss of an AvaII site (FIG. 3).

EXAMPLE 2

Genotyping for the CYP2C9 polymorphism was carried out by amplification by PCR followed by digestion with the restriction endonuclease AvaII.

One hundred patients were recruited from two anticoagulation clinics in the Newcastle area over four study days. Body weight and height were measured, the basal metabolic index ("BMI") calculated for each patient and details of age, sex, drug history, current and previous International Normalized Ratio ("INR") determinations, indications for anticoagulation and other significant health problems were all recorded. DNA was isolated by a standard manual chloroform-phenol extraction procedure and 1 μg was subjected to PCR analysis. As shown in FIG. 10 the C→T substitution, which converts Arg→144 to Cys, resides in exon 3 of the CYP2C9 gene and results in the loss of an AvaII restriction site (. . . GAGGACCGTGTTCAA . . . ) in the R144C allele (. . . GAGGACTGTGTTCAA . . . ). This provided the basis of the amplification strategy. A CYP2C9 specific intron forward primer (HF18, TGCAAGTGCCTGTTTCAGCA, FIG. 10) and a CYP2C9 exon 3 3'-end reverse primer (HF2R, AGCCTTGGTTTTTCTCAACTC, FIG. 10) were used at a concentration of 250 μM each. Amplifications were performed in a volume of 100 μl containing 20 mM Tris HCl (pH 8.3), 1.5 mM MgCl₂, 25 mM KC1, 0.05% (w/v) Tween 20, 10 μg gelatin/ml, 2% (w/v) DMSO, 200 μM each of dATP, dCTP, dGTP and dTTP and 2.5 units of Taq DNA polymerase (Perkin-Elmer). Reactions were carried out for 35 cycles at an annealing temperature of 55° C. for 90 sec, a polymerase temperature of 72° C. for 1 min, and a heat denaturing temperature of 93° C. for 1 min, using a Perkin-Elmer Cetus DNA thermal cycler. The PCR products digested with AvaII and sized using NuSieve agarose gels (3% NuSieve, 0.75% agarose). Presence of the CYP2C9 wild-type and R144C alleles were detected as fragments of 50+270 bp and 320 bp respectively (see FIGS. 3). The PCR product synthesized from human genomic DNA with the primers HF18/HF2R was directly sequenced on an ABI 373A automatic sequencer. Briefly, the PCR product was first purified by using the Wizard DNA clean-up system (Promega Co., Madison, Wis.). The purified template was then subjected to dideoxy terminator cycle-sequencing with the primers HF18 and HF2R. The primer-extended products were purified and sequenced following the manufacturer's procedure. Sequence analysis was done by using the MacVector software program (Eastman-Kodak Co., Rochester, N.Y.).

DNA was obtained from 94 patients. Of these 58 (62%) were homozygous for the wild-type CYP2C9 gene and 36 (38%) were heterozygous for the R144C allele. No R144C homozygotes were found. The frequency of the wild-type (Arg-144) and R144C (Cys-144) alleles in the study population is thus 0.808 and 0.192 respectively. An expectation of 3.7% R144C homozygotes can be anticipated from the Hardy-Weinberg equilibrium, but the 95% confidence interval in this estimation of 0.8-8.4% and thus the finding of zero homozygotes in 94 patients is not significantly different from expectation. The specificity of the PCR reaction with respect to the CYP2C9 gene was confirmed by sequencing. The alignment of the sequence obtained from the PCR product with that corresponding to the CYP2C9 gene showed a 100% degree of homology. Interestingly, a heterozygous pattern was obtained for the R144C allelic variant, confirming the high frequency of this allele within the normal population. No sequence deriving from CYP2C9, CYP2C18 or CYP2C19 was found confirming the specificity of the assay for CYP2C9.

EXAMPLE 3

Method for determining the genotype CYP2A6

Genotyping for the CYP2A6 polymorphism is carried out by allele-specific PCR using two parallel PCR reactions, one specific for the wild-type allele, one for the mutant allele. Amplifications are performed in 0.5 ml microcentrifuge tubes in a volume of 45 μl containing 10 mM Tris-HCl, pH 8.8, 1.5 mM MgCl₂, 50 mM KCl, 0.1% Triton X-100, 5% dimethylsulfoxide, 200 μM each of dTTP, DATP, dCTP and dGTP, 250 μM of the primers B and either J51 or J61, 1.25 units Taq polymerase and 1 μg human leukocyte genomic DNA. PCR conditions consist of 40 cycles with a denaturation at 93° C. for 1 min., annealing at 57° C. for 2 min and polymerization at 70° C. for 2 min. The products are analyzed by electrophoresis on 1% agarose minigels in TBE buffer and DNA is visualized by staining with ethidium bromide. As shown in FIG. 4, there are three possible results: the individual may be homozygous for the wild-type allele and give a DNA product only for the PCR reaction with primer J51, the individual may be heterozygous with one wild-type and one mutant allele and give DNA products with both primers J51 and J61 or the individual may be homozygous for the mutation and give a DNA product only with the J61 primer.

EXAMPLE 4

Alternative Method for Determining the Genotype CYP2A6

For use of F4 and R4 primers, each reaction mixture contained 600 ng human genomic DNA, 0.2 μM of each primer, 200 μM dNTP's, 0.8 mM magnesium acetate and 2 units of rTth I DNA polymerase. Hot start was as indicated by the manufacturer (Perkin Elmer) and the amplification reaction of 31 cycles of 93° C., 1 min; 66° C., 6 min 30 sec. Amplification products were analyzed in 0.7% agarose gels and the DNA visualized by staining with ethidium bromide. For the exon 3 specific amplification, the reaction which uses, the primers E3F and E3R consist of 5 μl of the 7.8 Kb PCR reaction, 0.5 82 M of each primer, 200 μM dNTP's, 1.5 μM MgCl₂ and 2.5 units of Taq DNA polymerase. The amplification reaction consisted of 94° C. for 3 minutes followed by 31 cycles of 94° C., 1 minute; 60° C., 1 minute and 72° C., 1 minute.

Amplification products were then digested without purification with restriction endonucleases which detect the CYP2A6 wild type (no digestion), CYP2A6v1 (XcmI) and CYP2A6v2 (DdeI). DNA was visualized by use of ethidium bromide after electrophoresis in 1% agarose, 3% NuSieve agarose.

It is of note that CYP2C9 genotyping can be performed using an allele-specific assay similar to that used above for CYP2A6.

    __________________________________________________________________________     SEQUENCE LISTING     (1) GENERAL INFORMATION:     (iii) NUMBER OF SEQUENCES:17     (2) INFORMATION FOR SEQ ID NO:1:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 1740 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: double     (D) TOPOLOGY: linear     (ix) FEATURE:     (A) NAME/KEY: CYP2A6v2     (B) LOCATION:     (D) OTHER INFORMATION:     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:     ATGCTGGCCTCAGGGATGCTTCTGGTGGCCTTGCTGGCCT40     GCCTGACTGTGATGGTCTTGATGTCTGTTTGGCAGCAGAG80     GAAGAGCAAGGGGAAGCTGCCTCCGGGACCCACCCCATTG120     CCCTTCATTGGAAACTACCTGCAGCTGAACACAGAGCAGA160     TGTACAACTCCCTCATGAAGATCAGTGAGCGCTATGGCCC200     CGTGTTCACCATTCACTTGGGGCCCCGGCGGGTCGTGGTG240     CTGTGTGGACATGATGCCGTCAGGGAGGCTCTGGTGGACC280     AGGCTGAGGAGTTCAGCGGGCGAGGCGAGCAAGCCACCTT320     CGACTGGGTCTTCAAAGGCTATGGCGTGGTATTCAGCAAC360     GGGGAGCGCGCCAAGCAGCTCCTGCGCTTTGCCATCGCCA400     CCCTGAGGGACTTCGGGGTGGGCAAGCGAGGCATCGAGGA440     GCGCATCCAGGAGGAGTCGGGCTTCCTCATCGAGGCCATC480     CGGAGCACGCACGGCGCCAATATCGATCCCACCTTCTTCC520     TGAGCCGCACAGTCTCCAATGTCATCAGCTCCATTGTCTT560     TGGGGACCGCTTTGACTATAAGGACAAAGAGTTCCTGTCA600     CTGTTGCGCATGATGCTAGGAATCTTCCAGTTCACGTCAA640     CCTCCACGGGGCAGCTCTATGAGATGTTCTCTTCGGTGAT680     GAAACACCTGCCAGGACCACAGCAACAGGCCTTTCAGTTG720     CTGCAAGGGCTGGAGGACTTCATAGCCAAGAAGGTGGAGC760     ACAACCAGCGCACGCTGGATCCCAATTCCCCACGGGACTT800     CATTGACTCCTTTCTCATCCGCATGCAGGAGGAGGAGAAG840     AACCCCAACACGGAGTTCTACTTGAAGAACCTGATGATGA880     GCACGTTGAACCTCTTCATTGCAGGCACCGAGACGGTCAG920     CACCACCCTGCACTATGGCTTCTTGCTGCTCATGAAGCAC960     CCAGAGGTGGAGGCCAAGGTCCATGAGGAGATTGACAGAG1000     TGATCGGCAAGAACCGGCAGCCCAAGTTTGAGGACCGGGC1040     CAAGATGCCCTACATGGAGGCAGTGATCCACGAGATCCAA1080     AGATTTGGAGACGTGATCCCCATGAGTTTGGCCCGCAGAG1120     TCAAAAAGGACACCAAGTTTCGGGATTTCTTCCTCCCTAA1160     GGGCATAGAAGTGTTCCCTATGTTGGGCTCCGTGCTGAGA1200     GACCTCAGGTTCTTCTCCAACCCCCGGGACTTCAATCCCC1240     AGCACTTCCTGGGTGAGAAGGGGCAGTTTAAGAAGCGTGA1280     TGCTTTTGTGCCCTTCTCCATCAGAAAGCGGAACTGTTTC1320     GGAGAAGGCCTGGCCAGAATGGAGCTCTTTCTCTTCTTCA1360     CCACCGTCATGCAGAACTTCCGCCTCAAGTCCTCCCAGTC1400     ACCTAAGGACATTGACGTGTCCCCCAAACACGTGGGCTTT1440     GCCACGATCCCACGAAACTACACCATGAGCTTCCTGCCCC1480     GCTGAGCGAGGGCTGTGCCGGTGAAGGTCTGGTGGGCGGG1520     GCCAGGGAAAGGGCAGGGCCAAGACCGGGCTTGGGAGAGG1560     GGCGCAGCTAAGACTGGGGGCAGGATGGCGGAAAGGAAGG1600     GGCGTGGTGGCTAGAGGGAAGAGAAGAAACAGAAGCGGCT1640     CAGTTCACCTTGATAAGGTGCTTCCGAGCTGGGATGAGAG1680     GAAGGAAACCCTTACATTATGCTATGAAGAGTAGTAATAA1720     TAGCAGCTCTTATTTCCTGA1740     (2) INFORMATION FOR SEQ ID NO:2:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 1737 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: double     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: cDNA     (ix) FEATURE:     (A) NAME/KEY: CYP2A13     (B) LOCATION:     (D) OTHER INFORMATION:     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:     ATGCTGGCCTCAGGGCTGCTTCTGGTGACCTTGCTGGCCT40     GCCTGACTGTGATGGTCTTGATGTCAGTCTGGCGGCAGAG80     GAAGAGCAGGGGGAAGCTGCCTCCGGGACCCACCCCATTG120     CCCTTCATTGGAAACTACCTCCAGCTGAACACAGAGCAGA160     TGTACAACTCCCTCATGAAGATCAGTGAGCGCTATGGCCC200     TGTGTTCACCATTCACTTGGGGCCCCGGCGGGTCGTGGTG240     CTGTGCGGACATGATGCCGTCAAGGAGGCTCTGGTGGACC280     AGGCTGAGGAGTTCAGCGGGCGAGGCGAGCAGGCCACCTT320     CGACTGGCTCTTCAAAGGCTATGGCGTGGCGTTCAGCAAC360     GGGGAGCGCGCCAAGCAGCTCCGGCGCTTCTCCATCGCCA400     CCCTAAGGGGTTTTGGCGTGGGCAAGCGCGGCATCGAGGA440     ACGCATCCAGGAGGAGGCGGGCTTCCTCATCGACGCCCTC480     CGGGGCACGCACGGCGCCAATATCGATCCCACCTTCTTCC520     TGAGCCGCACAGTCTCCAATGTCATCAGCTCCATTGTCTT560     TGGGGACCGCTTTGACTATGAGGACAAAGAGTTCCTGTCA600     CTGTTGCGCATGATGCTGGGAAGGTTCCAGTTCACGGGAA640     CCTCCACGGGGCAGCTCTATGAGATGTTCTCTTCGGTGAT680     GAAACACCTGCCAGGACACAGCAACAGGCCTTTAAGGAGC720     TGCAAGGGCTGGAGGACTTCATCGCCAAGAAGGTGGAGCA760     CAACCAGCGCACGCTGGATCCCAATTCCCCACGGGACTTC800     ATCGACTCCTTTCTCATCCGCATGCAGGAGGAGGAGAAGA840     ACCCCAACACAGAGTTCTACTTGAAGAACCTGGTGATGAC880     CACCCTGAACCTCTTCTTTGCGGGCACTGAGACCGTGAGC920     ACCACCCTGCGCTACGGTTTCCTGCTGCTCATGAAGCACC960     CAGAGGTGGAGGCCAAGGTCCATGAGGAGATTGACAGAGT1000     GATCGGCAAGAACCGGCAGCCCAAGTTTGAGGACCGGGCC1040     AAGATGCCCTACACAGAGGCAGTGATCCACGAGATCCAAA1080     GATTTGGAGACATGCTCCCCATGGGTTTGGCCCACAGGGT1120     CAACAAGGACACCAAGTTTCGGGATTTCTTCCTCCCTAAG1160     GGCACTGAAGTGTTCCCTATGCTGGGCTCCGAGCTGAGAG1200     ACCCCAGGTTCTTCTCCAACCCCCAGGACTGCAGTCCCCA1240     GCACTTCCTGGATGAGAAGGGGCAGTTTAAGAAGAGTGAT1280     GCTTTTGTGCCCTTTTCCATCGGAAAGCGGTACTGTTTTG1320     GAGAAGGCCTGGCCAGAATGGAGCTCTTTCTCTTCTTCAC1360     CACCATCATGCAGAACTTTCGCTTCAAGTCCCCTCAGTCG1400     CCTAAGGATATCGACGTGTCCCCCAAACACGTGGGCTTTG1440     CCACGATCCCACGAAACTACACCATGAGCTTCCTGCCCCG1480     CTGAGCGAGGGCTGTGCTGGTGCAGGGCTGGTGGGCGGGG1520     CCAGGGAAACGGCCGGGGCAGGGGCGGGGCTTGTGGGAGG1560     GGCGGGGCTAAGAATGGGGGCAGTGGGGGAAGGAAGGGGA1600     GAGGTGGTTAGAGGGAACAGAAGAAACAGAAGGGGCTCAG1640     TTCACCTTGATGATGTCCTTCAGAGCTGTGATGAGAGGAA1680     GGGAAACCTTACAGTATGCTACAAAGAGTAGTAATAATAG1720     CAGCTCTTATCTCCTGA1737     (2) INFORMATION FOR SEQ ID NO:3:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 7216 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: double     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: genomic DNA     (ix) FEATURE:     (A) NAME/KEY: CYP2A6v2     (B) LOCATION:     (D) OTHER INFORMATION:     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:     AAGTTCCCCTGAAATATGGCTCTGGTCTTCCTCCCCTTGC40     CAATGAAGAAGATGGCAGTGGAGGTTCTATGGCAGCCATC80     CTGGCCTCACTCTGAGGTTCCAATGAGGATTCTGGGCATC120     AAGAGACAGCTCTGGGCAAAGCTAAATCAAGTCAGCCCCT160     GGACCCAGTGCTGGGCTGCTGGGCTTTCTGGGAGAACGCC200     GCTGGGCTTGCTACACACTCCTCCTCCCAGAAACTCCACA240     CCCACAGCCCTGGGTCTTCCTAGCCCCGAGACTTTCAAGT280     CCATATGCCTGGAATCCCCCTTCCTGAGACCCTTAACCCT320     GCATCCTCCACAACAGAAGACCCCTAAATGCACAGCCACA360     CTTTGTCTTACCCTAATAAAACCCAGACCTTTGGATTCCT400     CTCCCCTGGAACCCCCAGATCCGCACAACTTTGGGGTGCA440     TTCTCACTCTCAGACCCCAAATCCAAAGCCCAAGTGCTCC480     CCTATGCAAATATTCCAAACTCCTCAGTTCTACAGCTTAT520     CTGTTGCCCCCTCCTAAATCCACAGCCCTGCGGCACCCCT560     CCTGAAGTACCACAGATTTAGTCTGGAGGCCCCCTCTCTG600     TTCAGCTGCCCTGGGGTCCCCTTATCCTCCCTTGCTGGCT640     GTGTCCCAAGCTAGGCAGGATTCATGGTGGGGCATGTAGT680     TGGGAGGTGAAATGAGGTAATTATGTAATCAGCCAAAGTC720     CATCCCTCTTTTTCAGGCAGTATAAAGGCAAACCACCCCA760     GCCGTCACCATCTATCATCCCTCTACCACCATGCTGGCCT800     CAGGGATGCTTCTGGTGGCCTTGCTGGCCTGCCTGACTGT840     GATGGTCTTGATGTCTGTTTGGCAGCAGAGGAAGAGCAAG880     GGGAAGCTGCCTCCGGGACCCACCCCATTGCCCTTCATTG920     GAAACTACCTGCAGCTGAACACAGAGCAGATGTACAACTC960     CCTCATGAAGGTGTCCCAAGACAGGGAGATGGGTGTCTCG1000     GGGTGGGGGCTGCCTAGTTGGCTGGGGCTTTGTGGCAGGG1040     GGTTGACCAGTGTGGACCAGAGTCTTAGGAAATGGAGTTT1080     TGGAGTTTCAGCATCAGAAAGACAGGATCTTGGGATGTCC1120     AGCTCCCTGACTGTGAGAACCTGGGTGCGAAGCATCCCAG1160     CACATGACATCTCGGTGCTGGGCCCCATTCAGAGTGGAGG1200     GTTCTCCCTCTAACCACTCCCACCCACCTCCATCAGATCA1240     GTGAGCGCTATGGCCCCGTGTTCACCATTCACTTGGGGCC1280     CCGGCGGGTCGTGGTGCTGTGTGGACATGATGCCGTCAGG1320     GAGGCTCTGGTGGACCAGGCTGAGGAGTTCAGCGGGCGAG1360     GCGAGCAAGCCACCTTCGACTGGGTCTTCAAAGGCTATGG1400     TGCCCAAGAGGGGGAAGGTGGGCAGGTGGACACGAAGGTC1440     TCAGTGTTCCCAGCCTTCTCCCTGACTCTCCTGACAACTG1480     GAGGATAAGGGAGAGTCCCCAGTCTGGTCTTCCCTCCCCA1520     TCTCCCTACATTGGGGCCTCTCCATGTGTATCCCTCACCT1560     GTCTCCAGCGGCCCTGTCCTGATTCCTCCCTGCCTCTCTC1600     TGCCCCACCTCCTTATTCTCTCTCACTGGAGTCTCCTCTT1640     TCCCCTCTCTCTCCATCTCTAAGGACATCCTGGGTTTCTG1680     TTTACCAGCCCTGGGTCTCTGTCTACATGAGTCTTTGAGG1720     CCCTCTTAGCTTCTGGGCTTCTCTGGGTTTCTCATCTCTC1760     CGGATCCCTTTCTCAATTCTTCCTCTGTCTTAGGATGCCA1800     GGGTTATTCCTACTTCCACATCTTCAGGCTCCATCTCCTG1840     GTAACAGTCTCTCTTCCTTCCAGACCCTCTCTGTTTCTAT1880     CTCAATATTAAACTCTCTGCTCCAGCTCAGCTTAAGAATC1920     TCACACCAAGAGAGGATGTCCTCCACCCAGATCTCCCCAT1960     ATCTCACTACCCCACCCTCCATCCTCTGCCTCCATCACTC2000     TCTTTCTCTCCCCACTGCNCCTGCGGACGCGATCCAATGG2040     AGTGTGGAGCTAATGCCGTGAAGCTATGTGCATCTCTCTG2080     TCTGGCCGTACCTGGGTAATAACCTGATCGACTAGGCGTG2120     GTATTCAGCAACGGGGAGCGCGCCAAGCAGCTCCTGCGCT2160     TTGCCATCGCCACCCTGAGGGACTTCGGGGTGGGCAAGCG2200     AGGCATCGAGGAGCGCATCCAGGAGGAGTCGGGCTTCCTC2240     ATCGAGGCCATCCGGAGCACGCACGGTGAGCAGGGGACCC2280     CGAGTGCGGGGGCAGGAGAAGGAAAACACCCAGGACGAGG2320     AACCCGCGCGCGTTCTGCCTGGGGATGGGGACTAGGTGGG2360     GAAAGGCGCCCGCACTTCCAGCCCTGGAGTCTGGCGCTGG2400     GAATTTGGCTCAACAAGGCCCTGCCTCCTGGAATTCTGAC2440     TCTCCTCAGACCTCTGAGTTGACTCTCTCCCCAACCCCCT2480     TCTCCCGACATACCCGGAGGCGCCAATATCGATCCCACCT2520     TCTTCCTGAGCCGCACAGTCTCCAATGTCATCAGCTCCAT2560     TGTCTTTGGGGACCGCTTTGACTATAAGGACAAAGAGTTC2600     CTGTCACTGTTGCGCATGATGCTAGGAATCTTCCAGTTCA2640     CGTCAACCTCCACGGGGCAGGTAATGGTTGCAGCCCGGCC2680     CGTGAAGGCCCTTACCAAAACCGGCAAATTGTTCCCCTAC2720     CGGGGGAAGGGGGCCCCAAATTCCCACCGCCCCCCGGACA2760     GTGTCCCCTCAAAATCAGTCCCCGATTTGGGCAAATTGGC2800     AGAGTGGAACCAGACCCGGGTTGGTTGTCCAATCCCCTGC2840     TCTCCAGGGACACCGGGATAGCACAACAGATGCTCCCCAA2880     AACAGAGCCTGCTGGCAGGATGCATACCCTCAGCTCAGCT2920     CTCTCACCCTGGGCACGTGTTCCCATCCCCAACTTACCGG2960     TAATTTCTAACAGATGCTCCCTACCCAGGTCTTCTTGAAT3000     ATTTTAACACCCGGAAACCCTGGGTACCTAACCTTCCCTG3040     TAAACTTTAGAGATTAGTTCCTATCCGGCCCCTCTGAAAT3080     ACCTAACCACCGGAGACCAGATGCCTTTAACTCAGTTCCT3120     TCCTTGCTATGAAACAAATCCCATTCCCATCAGCTCCTGC3160     CCCGTGACAGCTGTCCTTCCCTTCCCATCCTCTCTCTGCA3200     ACCCCAGCTCTATGAGATGTTCTCTTCGGTGATGAAACAC3240     CTGCCAGGACCGCAGCAACAGGCCTTTCAGTTGCTGCAAG3280     GGCTGGAGGACTTCATAGCCAAGAAGGTGGAGCACAACCA3320     GCGCACGCTGGATCCCAATTCCCCACGGGACTTCATTGAC3360     TCCTTTCTCATCCGCATGCAGGAGGTACACCCCAGCAGCC3400     ACTGCGGGGAGATGCAAAGCCAGGCAGAGGGAAATCAGTC3440     TGGGAGTGGGGCAGGCAGATGACACAGGCCCATTCAAATT3480     AACCCTCATCATAATAATCCTCACAATTGGCTGGGTGCCG3520     TGGCTAACAGCCTGTAATCCCAGCACTTTGGGAGGCCGAG3560     GCAGGTGGATCACCTGAGGTCAGGAGTTCGAGACCAGCCT3600     GGCCAACATGGTCAAACCCCGTCTCTACTAAAAATCCAAA3640     AATTAGTTGGGCATGGTGGCGCGAAGGGGGGCAGAGGTTG3680     CAATGAGCCAAGATCACGGCATTGCACTCCAGTCTGGGTG3720     ACAGAATGAGGCCCTGTGTCAAAAAAAATTAATCACTTGT3760     TTAAAAAGTAAGTGAGCCTGCATGGTCATGCGCATGTGCA3800     GCTCCAGCTACTCAGGAGGCTGAGGCTGGAGGATTGCTTG3840     AGCTCAGGAGTTGGCGTCCGGCCTGTGCAACTTAGCAAGA3880     CCAAGTCAGTATAAGAAAAAAAAAAAACAAAAAAAAAGCT3920     GACAGCTAAGTTGATAATTGACGGACAGATGGTCAGCAAG3960     GTAACGAAGGTGAGAAGGAAGAGCATTGGGGGCAACGCCA4000     GGAGTCAGGGCAAGGGCTGGTTCCTAGAGCGAGTCTGGTA4040     GGATCTAGGGCCCCTCTTCTCCACCCTGCGGTCTTGCCCC4080     AAAGAGAGGTCGAGGGTGCTGGGATTGCGCTAGACTCGAG4120     TCTGTGTAGATCTTGGGGTCCCCTCTTGACCCCCATTGGT4160     CTGAACCTAAGAGTGGAAGATCCATGGGGTGAACCCCTAG4200     ATGGTGCCCTGAGGTCAAGCAGGAGTGAGGTTGTCCTAAA4240     GCCCCCTCTCCCTTCAGGAGGAGAAGAACCCCAACACGGA4280     GTTCTACTTGAAGAACCTGATGATGAGCACGTTGAACCTC4320     TTCATTGCAGGCACCGAGACGGTCAGCACCACCCTGCACT4360     ATGGCTTCTTACTGCTCATGAAGCACCCAGAGGTGGAGGG4400     TAAGGCTGGAGGGGGACGGAAGTGGAGGGCCCCAGACCCT4440     CAAAATTCCCCTTCGACTGGTGCAATGTCCCCACCTGTCC4480     CAGATCCCGGGACCCTGAGACGTGACTTGCTGTCCAGAGA4520     CAGGGCAACATTCAGCTGGTAGGCATCAGCTGAGTCTCAT4560     TAGATATTAAAATATTGAAAATGTCTGCACTGATTGGTCA4600     GTCACTTCTGTCCCAAGCCCACTGAGTGCCCACTGCCCGT4640     TCCACCGGGTCATCCCCTAAGTTCCTCCCTGTGCCTCCCC4680     TGTGATTCTGGCACAACCTGGTTAACAGGATCCTACTCCA4720     ACAATGCGAATGGGTGATGTCTGTTCTGTTATGAATGCTC4760     TACTTCCGTCTCATAGGCGGAGGCATTTCATCCACCCCAT4800     TTTGCCTATCCGGACTATCATTTCCTGCTCTGAGACCCCT4840     AGATACCTAAACACATTCCCCCTCCTCCCCCAGCCAAGGT4880     CCATGAGGAGATTGACAGAGTGATCGGCAAGAACCGGCAG4920     CCCAAGTTTGAGGACCGGGCCAAGATGCCCTACATGGAGG4960     CAGTGATCCACGAGATCCAAAGATTTGGAGACGTGATCCC5000     CATGAGTTTGGCCCGCAGAGTCAAAAAGGACACCAAGTTT5040     CGGGATTTCTTCCTCCCTAAGGTGCTATCCGCCCCCACCC5080     CCCAGACTACGGGGACTCCAGCCCCTCTCTGTGTCCCCAG5120     CATCCCACCCACATTAGAAGCTTTCTAGACCCTGTCCCAC5160     TCCCTCAATCAGTCAAAAAAGACTTCCCCAACCACCACAT5200     CCGTTCCACCTTTCCACTTAGACACTCCTGAGTCCTGCAT5240     CTCTCCAGACTCTTTGTGTCAGGAGAATCAAACACATGTT5280     CCCAAACTTCCTATCTTAAGAAACAGAAGCCCCCTTTCCA5320     TTCGGCCTTTTGTCATAGGGACAGAAATCTCAGGTCCCCC5360     AAACTCCTGCCTAGAAGGACATGGACCCCATGTCTCCCAA5400     ACTTCCTGTTTCAGAGATGTGAACCTTCTATCCCCCAAGG5440     TCCTCCCTCAGAGGTCCCCAATTCCCATGCCTGCCACTTC5480     CCCTCACCGGGGCACCCTAGTTCCCCCTCCAGCCCCTGTG5520     TACTCTCAACAATCCCCCAACCCGCCTCATCACATACACC5560     TTCCTCCTCCCTCCCAGGGCATAGAAGTGTTCCCTATGTT5600     GGGCTCCGTGCTGAGAGACCTCAGGTTCTTCTCCAACCCC5640     CGGGACTTCAATCCCCAGCACTTCCTGGGTGAGAAGGGGC5680     AGTTTAAGAAGCGTGATGCTTTTGTGCCCTTCTCCATCAG5720     TAAGAGACCACTGTTTGGTGCCAGGCTTACTACTCACACC5760     AGCAGGGGCCTCCCTTACCCAGTTCCCCTCTCTGCCGTGT5800     AGCCTAGTATTTCCCCAGCTTGGCAAGTTCCTGTTAGCAA5840     TCTACCGTCGAGCCACCAGGTGATACTCCCTTAACTACCA5880     AGCACCCAGTACCTGTGCCCAGGCAAAAGGAAAGGAAACA5920     TCATACCCCTTTCAGAGGCGGGGGAAAACCAAAGGCCAGA5960     GAGAATCAGAGATTTATTTCCCTAGGGTCACACAGGAGAT6000     TCTTCAGCATCCCTAAAAAGGAGATGACGGCACAGCAGGT6040     CATATTTGGGAGTTCTTATCTGGGGGAAGGGGGATCTTAA6080     ACCTCCCATTGTGGACACCTGGCATCGATCAACCCCATCT6120     TTTGGTCATCTTTTGGGTCACTCAAGGAAACTGAGGTCAA6160     GGAGGGTCAAGAGGCTCCCTCTTAAAGTCTCTCAGGGCCA6200     TATATTCCACCCTTCCTCCCTGGGAGAGCCGCAGCTGGAG6240     GTCGGTACTGGGGCGAGGCTGCACTGAGAGTGGGCTTCAC6280     CTCCACCCCTCCCGCCTCTCCTCCTCAGGAAAGCGGAACT6320     GTTTCGGAGAAGGCCTGGCCAGAATGGAGCTCTTTCTCTT6360     CTTCACCACCGTCATGCAGAACTTCCGCCTCAAGTCCTCC6400     CAGTCACCTAAGGACATTGACGTGTCCCCCAAACACGTGG6440     GCTTTGCCACGATCCCACGAAACTACACCATGAGCTTCCT6480     GCCCCGCTGAGCGAGGGCTGTGCCGGTGAAGGTCTGGTGG6520     GCGGGGCCAGGGAAAGGGCAGGGCCAAGACCGGGCTTGGG6560     AGAGGGGCGCAGCTAAGACTGGGGGCAGGATGGCGGAAAG6600     GAAGGGGCGTGGTGGCTAGAGGGAAGAGAAGAAACAGAAG6640     CGGCTCAGTTCACCTTGATAAGGTGCTTCCGAGCTGGGAT6680     GAGAGGAAGGAAACCCTTACATTATGCTATGAAGAGTAGT6720     AATAATAGCAGCTCTTATTTCCTGAGCACGTACCCCCGTG6760     TCACCTTTGTTCAAAAACCATTGCACGCTCACCTAATTTG6800     CCACAAAACCCCCTTCGAAGGGGCGTTCATGCCCATTTTA6840     CACGTGACAAAACTGAGGCTTAGAAAGTTGTCTCTGATGT6880     CTCACAAAACATAAGTGCCCAGAAAATCTGCGAACACAGA6920     TCTGTGCCCATAGCCTTCTAGACAGATTCTTAAAAAGCAC6960     CTATTCCTCACGCAAAACAGTTTAGTATAGAATCACATGG7000     CCTGAACATCCCTGTCCGGGGGAGTTCCCCAGAGACCTGG7040     GGGGTGGTTGCCCTGCCTTCACTGCACACATGCCCACACT7080     CTCACCTACTCAACATGCTGTGACTACCCGGGTGTAATCT7120     GTGCTTGCTACCAGATAAGGCCACTGTAGCCCATTCAGAG7160     TCAGCCCAGGGACACAACGAGACATGACTGGACATACAGG7200     GTCAGTCCATTAACAA7216     (2) INFORMATION FOR SEQ ID NO:4:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 8779 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: double     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: genomic DNA     (ix) FEATURE:     (A) NAME/KEY: CYP2A13     (B) LOCATION:     (D) OTHER INFORMATION:     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:     TGGGTCCAACCAAAATCAAAGTCAATTCCCTGGGCCCAGT40     GCTGGGCTGCTGGGCTTTTCTGGGAGCACCTGCTGGGCTT80     GCTACACACTCCACCTCCCAGAAACTCCACACCCACAGCC120     CTGGGTCTTCCTAGCCCCAAGACTTTCAAGTCCATATGCC160     TGGAATCCCCCGTCCTGAGACCCTTAACCCTGCATCCTCC200     ACAACAGAAGACCCCCAGATGCACAGCCACACTTCCATCT240     CACCCTAGTAAAACCCAGACCTTTGGATTCCTCTCCCGTG280     GAATGCCCAAATCCACAACTTTGGGGTGCAGTCTCACTCC320     CAGATCCCAAATCCAAAGACCAGGTGCTCCCCTGTGCAAA360     TATTCCAAACTCCTCAGTTCCACAGTTTATCTGTTGCCCG400     CTCCTAAATCCACAGGCCCTGCAGCAACCCTCCTGAAGTA440     GCAGAGTTAGACTGGAGTTCCCCTCCCTGTTCATCTTGCC480     CTGGGGTCCCTCTCCTCCTCCCTTGCTGGCTGTGTCCTAA520     GCTGTGTGGGATTCAGGGTTGGGGTGTAGTTGGGAGGTGA560     AATGAGGTGATTATATAATCAACCACAGTCCATCCCTCTT600     TTTCAGGCAGTATAAAGGCAAACCACCCCAGCCATCACCA640     TCTATCATCCCATGGCCACCATGCTGGCCTCAGGGCTGCT680     TCTGGTGACCTTGCTGGCCTGCCTGACTGTGATGGTCTTG720     ATGTCAGTCTGGCGGCAGAGGAAGAGCAGGGGGAAGCTGC760     CTCCGGGACCCACCCCATTGCCCTTCATTGGAAACTACCT800     CCAGCTGAACACAGAGCAGATGTACAACTCCCTCATGAAG840     GTGTCCTAAGGCAGGAGATGGGTGGCACGGGGTGGGGGCT880     GCCCAGTTGGCTGGGGCTTAGTGGCAGGGGATTGACCAGT920     GTGGACCAGAGTCTTAGGAAAGGGAGTCTTGGAGTTTCAG960     CATCAGGGTCCTAGCAGGAAAGACAGGATCTTGGGATGTC1000     CAGCTCCCTGACTGTGAGAACCTGGGGGGCGAAGCATCCC1040     AGTACATGATATCTCAGCGCTGGGCCCATTCAGAGTGGGG1080     GCTGCTCCCTCTAACCACTCCCACCTGCCTCCAACAGATC1120     AGTGAGCGCTATGGCCCTGTGTTCACCATTCACTTGGGGC1160     CCCGGCGGGTCGTGGTGCTGTGCGGACATGATGCCGTCAA1200     GGAGGCTCTGGTGGACCAGGCTGAGGAGTTCAGCGGGCGA1240     GGCGAGCAGGCCACCTTCGACTGGCTCTTCAAAGGCTATG1280     GTGAGGGGGTGCCAAGAGGGGGAAGGTGGTCAGGTGGATG1320     CAATGGTCTCCGTGTCCCCAGCCTTCTCCCTGACTCTCCT1360     GCCCACTGGAGGATATGGGAGAGCCCCCTGTCTGGTCTTC1400     TCTCCCCATCTCCCTACATCGGGGACTCTCCCTGTGAGTC1440     CCACACCTGTCTCCAGCGCCCCTGGCGTGATTCCTCCCTG1480     CCTCTCTCTGCCCCGTCTCCTCCTTCTCTCTCACTGGAGT1520     CTCCTCTTACCCCTCTCTCTCCATCTCTGAGGACATCCGG1560     GGTTTCTGTTTACCAGCCCTGGTCCTCTGTCTTCATTTGT1600     CTTTTTGTCGCTCTCGGCTTCTGTGCTTCTCCGTGTTTCT1640     CCTCTCTCTGCTTCCCTCTCCCACTTCTTCCTCTGTCTTA1680     GGATTTCAGGGTATTCCTACTTCCACATCTCCAGCTCCCA1720     ACTCCTGGTAATTGTCTGTCCTCCTTCCCGATCCTCTCTG1760     TTTCTGTCTCCATATTTTTCTCTCTCTTCTCCAGTTCAGA1800     TTAAGAATCTTTCACCATTTTTATTTCCTCCTCCCAGATC1840     TCCCCATATCTCACTTCCCCTCCCTCCATCTCTCTCTTTC1880     TCTCCCCACTACCTTCCCTTCCTCCATGGAGTATCCCCGT1920     ATCCCTCTGTTTCTCTGCATCTGTCTGTCTGGCCTTTCTG1960     CTTCTCTTCTGATTCTCTTATTCTTTCTACCCGGACTCTC2000     TCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTC2040     TCTCTCTCTCTCTCTCTCGTGCTCTCGTGTTTCTCTGACT2080     GAGAGTGTAGCTCTCCTGGGCACTCGCGCTGAATCCATCT2120     CTCTCCCACACCACTCCCTCTCTACACCACCCTTGGGGAG2160     CCCCTTGGAACTGGTCCGCTCCTGCTACCACCACCCCCTG2200     ACCTCTCTCCACCCCCGCGTTCACCTCCCCAGGCGTGGCG2240     TTCAGCAACGGGGAGCGCGCCAAGCAGCTCCGGCGCTTCT2280     CCATCGCCACCCTAAGGGGTTTTGGCGTGGGCAAGCGCGG2320     CATCGAGGAACGCATCCAGGAGGAGGCGGGCTTCCTCATC2360     GACGCCCTCCGGGGCACGCACGGTGAGTAGGGGACCCCGA2400     GTGCGAGGGCGGGAACCCGCGCTTTCTGCCTGGGGATGGG2440     GACTAGGTGGGGAAAGGGGCCCGCACTTCCAGCCCTGGAG2480     TCTGGCGCTGGGATTCGGCTCAACAGGGCCCTGCCTCCTG2520     GAATTCTGACTCTCCTCAGACCTCTGAGTTGACTCTCTCC2560     CCAACCCCCCTTCTCCCGCCACACCTGTAGGCGCCAATAT2600     CGATCCCACCTTCTTCCTGAGCCGCACAGTCTCCAATGTC2640     ATCAGCTCCATTGTCTTTGGGGACCGCTTTGACTATGAGG2680     ACAAAGAGTTCCTGTCACTGTTGCGCATGATGCTGGGAAG2720     GTTCCAGTTCACGGGAACCTCCACGGGGCAGGTAACTGGA2760     TGCAGCCCGCCAGTGACGCCCCTACCACAACCTGCCAACT2800     GCTCCCCTACCTGGAGACAGGTGCCCCAAACTCCCACCCC2840     CCTCCAGACAGTGTCCCCTCAAAATCAGTCCCCGATATTG2880     GACAACTGGACGGTTGGACCAGAACCCAGGAGGGATGCCC2920     AATACCCAGTCTCCAAGGACACCTGGATAGCTCAACAGAT2960     GATCCCCAAAACAGAGCCTGCTGGCAGGATGCATACCCTC3000     AGCTCAGCTCTCTCACCTGGGCACATGTTCCCATCCCCAT3040     CTTACCGTAATTTGTAACAGGTGCTCCCTACCCAGTTCTT3080     CTGAATATTTAACACCTGGACAAGTGACTGTGTCAACCCG3120     CCTCCTGCATACCTGAACACCTGGTGCTGCAAAATCCAGG3160     CCATCATAATCATTTCACATCTACACAAATGTCACAGATT3200     AGCCCACTGGTAATATCTGGCCAAGGGCCCTCTACTTCAC3240     CCACTTAAATGCCTGAAAACATGGACAGGTGCCCTAACCA3280     ATACCCTAAACACATAAATATCTAGATAGATTATTCCCTG3320     ACACCCAAATAAGTGTTCCCCAACCCTTTCCAATCACACA3360     CCTTACAGAGGTGCTCCCAGTGCATCCCACTTGGATAGGT3400     AAACACCTCAACAGGTATCCCCTCCACTTCAGCATCTTCA3440     CCAGCCCCACTTTATACCTGAGCACCTGAACAAAAGCCCC3480     CAATCCAGACCCAGTAAGTATCTGGACAGCTGTCTCCAAC3520     CAAGTCCACTTGAATGCCTAAATACCTAGACAGGTGCCAC3560     TCACCTCATACCAGCCCCACCTGAAGAGCTAAACACCTGG3600     ACAGGTGTCTTCCAACTCAACTTCACTTGAATATCTGAAC3640     ACCTAGATGTGTGCTCCAATCCAGCCTCATTTGCATACCT3680     GAAACCTGGATATATGCCTCAGTTCTTCTCACCTAAATTA3720     CTAGACCGTGCCCCTGGCACCTAATCCACGTGAAAACTTA3760     GATATAAGTTTCCATCCAACCCCACTGAAATACCTAAACA3800     CCTGGACAGATGCCTTTAACTCCGTTCCTTCCTTGCTATG3840     AAACAAATCCCCATTCCCATCAGCTCCTGCCCCGTGACAG3880     CTGTCCTTCCCTTCCCATCCTCTCTCTGCAACCCCAGCTC3920     TATGAGATGTTCTCTTCGGTGATGAAACACCTGCCAGGAC3960     CACAGCAACAGGCCTTTAAGGAGCTGCAAGGGCTGGAGGA4000     CTTCATCGCCAAGAAGGTGGAGCACAACCAGCGCACGCTG4040     GATCCCAATTCCCCACGGGACTTCATCGACTCCTTTCTCA4080     TCCGCATGCAGGAGGTACATCCCAGCAGCCAGTGCAGGCA4120     GGTGCAAAGCCAGGGAGAGGGAAATCAGGATGGGAGTGGG4160     GTGGGCAGACGACACAGGCCCATTCAAATTAGCCCTCGTC4200     ATAATAATCCTTACAATTGGCCAGGCGCGGTGGCTCATGA4240     CCTGTAATCCCAGCACTTTGGGAGGCCGAGGCAGGTGGAT4280     CACCTGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACATG4320     GTGAAACCCCGTCTCTACTAAAAATACAAAAATGAGCTAG4360     GTATGGTGGCATGCGCCTGTAATCCCAGCTACTCAGGAGG4400     CTGAGACAGAAGAATTTGTTTGAATCCGGGAGGCAGAGGT4440     TGCAGTGAGCCGGGATCATGCCACTGCACTCCGGCCTGAG4480     TGACAGAGCAAGACCCTGTAAAAAAAAAAAAAAAAAAAAA4520     AAAAAATTCCGGAAAACCCCAATTACATCACCCACTGCTG4560     TCCCATCTACTGAGCCCTCACCCACAAGGACGGGTTATGG4600     AGGTGGATTAGATTGGAAAGAACTTCTCAAGAACTACCGG4640     GTGCCAGGAACTGGGTTAAGTGTTTTATGATAGTCCGCCA4680     TGGAACACTTTTAACAGTTCTTGAGGGAGGTTCACTCATG4720     GCCCCAGTTGTACAAATGAGGAAACTGAGGCCCAGAGAGT4760     TTAAGTGTCTTAACTGAGGTCACAACAGTGAGGAAGACCA4800     TGGTCCCCCTAGCTCAAACCCTGGTCTCTCTGAGCCTATA4840     GCTGGTGCTTTTAGCCACCATGCTCTCTAACCGTTCATGT4880     CCTGGTTAGCAGACACACCTCTGTGGACAGGTGACCTGGC4920     TTTACATTGCAGGGTCCCCGCCTACCTCTGGATGTCAGCC4960     TCCCATGTGGGAAGGCTTTAGGAAGCCAAAGCTCAGGGAG5000     AAAGGATCAAGGGAGGGATTCCTCCACAGTAAGTTTCAAG5040     ATTTTTAGGGAAGAAATAGGATGCTGTTGCTTAAAATTCT5080     GTGCTTGTATCTCAGAAAAACTCTTTTTTTCTGACTCTTC5120     ATCTTGCCATCTCTGTACTACTTTCTCTTCGTCTCCCCTC5160     ATCCTTCTCTTTCCAAATATTCCTATCATTAAAAAAGTAA5200     CAGACTGGGAAACATGGCAAAACCCCGTCTGTACAAAAAA5240     ATGGCTAGGCATGGTGGTGCATGCCTGCGGTCCCAGCTAC5280     TAAGGAGGTTGAGGTGGGAGGATATCTTGAGCCCAGGGTG5320     GGCAGAGGTTTCAATGAGCCGATATCACAGCCCTGCCCTC5360     CAGCCTGGGTGACAGAATAAGACCGTGTCTCCCAAAAAAA5400     AAAAGAATTAATTTTTTAACAGTTAACAAGTGAGCCTGCA5440     TAGTCATGTGCATGTGCAGTTCCAGCTACTCTGGAGGCTG5480     AGACCGGAGGATTCCTTGAACCCAGGAGTTGGAGTCCAGC5520     CTGTGCAACTTAGCAAGACCAAGTCTGCATAAAAAAAAAA5560     AAAACCAACTGACAGCTAAGTTGACAATTAAAGGATAGAT5600     GATCAGTGAGGTAAAGAAGGTGAGAAGGAAGAGCATTTTG5640     GGCAAAGCCAGCAGCCAGGGCAAGGGCTGGAACCTGGAGC5680     GAGTTTGGCAAATCTAGGGTCCCTCTTTCCACCTTTGGTC5720     TGGACCAAAGAGAGGTAGCTCCAAAGGAAAAGCCCTAGAA5760     GGGCCCCAAGAGCATGGAGAGTGAGCTTGGTCTAAACCGC5800     CCTCTCCCTGCAGGAGGAGAAGAACCCCAACACAGAGTTC5840     TACTTGAAGAACCTGGTGATGACCACCCTGAACCTCTTCT5880     TTGCGGGCACTGAGACCGTGAGCACCACCCTGCGCTACGG5920     TTTCCTGCTGCTCATGAAGCACCCAGAGGTGGAGGGTAAG5960     ACTGGAAAGGGAGGAAAGTGAAGGGCCCCAGACCCTCAAA6000     ACTCCCCTGAGCCTGGTGCAGTGTACCCACCTATCCCAGA6040     TCCCAGGACCCTGAGACGTGCCTTGCTGTCCAGAGACAGG6080     ACAATATTCAGCTGATAGGCATCAGCTGAGTCTCATTAGC6120     TATTAAAATATTGAAAATGTCTGCACTGATTGGTCAGTCA6160     CTCCTGTCCCAAGCCCACTGAGTGTCCGCTGCCTGCTCCT6200     CTGGATCATCCCCTAAGTTCCTCCCTTGTCCTACCCTGTG6240     ATTCTGACACAACCTGGTTTAACAGGGATCCTGCTGCAAA6280     CAATGCGAATGGGTGATGTCTTGTTCTTGTTTATGAATGG6320     GCTTACCCTTCGTGTCAGAGGTGGAAGCTATGTCAACCGC6360     CGTGTTTTAGCTAGGGGGGGCGATACATGCCCTGCTCTAA6400     GACCCCTAGAGAGGGTAAAGATATTCCCCTCCTCCGCCAG6440     CCAAGGTCCATGAGGAGATTGACAGAGTGATCGGCAAGAA6480     CCGGCAGCCCAAGTTTGAGGACCGGGCCAAGATGCCCTAC6520     ACAGAGGCAGTGATCCACGAGATCCAAAGATTTGGAGACA6560     TGCTCCCCATGGGTTTGGCCCACAGGGTCAACAAGGACAC6600     CAAGTTTCGGGATTTCTTCCTCCCTAAGGTGCTGTCTCCC6640     CTCCACCACCACCACTCAGACTACGGGGACTTCCAGCCTC6680     TCTCTGTGTCCCCAGAATCCTGCCCCCATTAGTGTTCTAG6720     ACTCTGTCCCACTCCCTCAATCAGTCAAAAAAGACTTCCC6760     CAACCACCACATCTGTTCCACCTTTCCACTTAGACAGTCC6800     TGAGTCCTGCATCTCGCCAGACTCTTTGTGTCAGGAGAAT6840     ACACCCCATGTTCCCAATCTTCCTGTCTTAAGAAACAGAA6880     GCCCCCTTTCCATTAGGCCTTGTGGCTTAGGGACACAAAT6920     CTCAGGTCCCTCAAACACCCTGGCTAGTGGAACATGGACC6960     CCATGTCTCCCAAACTTCCTGTCTCAGAGACATGAAACTT7000     CTATCCCCCAAAGCTCCTCCCTCAGAGGTCCCCAACTCCT7040     CCATGTCGTGCCACTCCCCGCACCTGGGGGACCCTAGAGC7080     CCCCTGGAGCCCCTGTGTACTTTCACCAATCCCCCCAACC7120     TGGCTCATAACACACACCTTCCTCCTCCCTCCCAGGGCAC7160     TGAAGTGTTCCCTATGCTGGGCTCCGAGCTGAGAGACCCC7200     AGGTTCTTCTCCAACCCCCAGGACTGCAGTCCCCAGCACT7240     TCCTGGATGAGAAGGGGCAGTTTAAGAAGAGTGATGCTTT7280     TGTGCCCTTTTCCATCGGTAAGAGACACTGTTTGCTGCCA7320     GGCCACGGCTCACACCAGCAGGGGCCTCTCTCACCCACCT7360     CCCCTCTCTGCGGTGTAGCCTGGTATTTCTCCAGCTTGGA7400     AGTTCCTGTTAGAATCTACCATTGAGCCGCCACCAGCTGA7440     TACTCCCTTAACTGCCAAGCACCCAATACCTGCGCCCAGG7480     TAAAAGGGAAGGAAACATCTTCCCCCATAGATTTATTTGT7520     CTAGGGTCACACAGCAGATTCTTCAGCTCCCTGAAAAGGA7560     GATAATGGTACAGCACAGCAGTCATATTTGCAAGTGTATC7600     TGGGGGGTAGGGGCATCTAAACCTCCCATTGCTACACCTG7640     GCATGGATCACCCCATCTATGATGGAGGCATGACATTATG7680     CCTTTTTCGAAACCCATAGAACTGTATAACACAGAGTAAA7720     CCCTAATGTAAACTATGGACTTTGGTTAGTAATAATATAT7760     CAATATTGGTTCACCATTGTTATATCTCTTATAGAAGGAA7800     ACTGAAGCTCAGGGAGGATCGGAGTCTCCTCTGAAAGTCT7840     CTCAGGCCATAATATTCCCACCCCTCCTCCCTAGAGAGTG7880     CAGCCGGGGGTCAGTAGGGGTTGAGGCTGCACTGAGAGTG7920     GGCTTCACCTTCACCCCTCCTGCCTCTCCTCCTCAGGAAA7960     GCGGTACTGTTTTGGAGAAGGCCTGGCCAGAATGGAGCTC8000     TTTCTCTTCTTCACCACCATCATGCAGAACTTTCGCTTCA8040     AGTCCCCTCAGTCGCCTAAGGATATCGACGTGTCCCCCAA8080     ACACGTGGGCTTTGCCACGATCCCACGAAACTACACCATG8120     AGCTTCCTGCCCCGCTGAGCGAGGGCTGTGCTGGTGCAGG8160     GCTGGTGGGCGGGGCCAGGGAAACGGCCGGGGCAGGGGCG8200     GGGCTTGTGGGAGGGGCGGGGCTAAGAATGGGGGCAGTGG8240     GGGAAGGAAGGGGAGAGGTGGTTAGAGGGAACAGAAGAAA8280     CAGAAGGGGCTCAGTTCACCTTGATGATGTCCTTCAGAGC8320     TGTGATGAGAGGAAGGGAAACCTTACAGTATGCTACAAAG8360     AGTAGTAATAATAGCAGCTCTTATCTCCTGAACAAGTCCC8400     TCCCTGTCAGCTTTGTTCAAAAAGCGTTGCACGCTCACCT8440     CACTTATTTGCCACACACCTCTACCAATGGGGGAAAAGTC8480     TTCATTCCCCTTTTTACACGTGAGAAAGGTGCGGCTCAGA8520     AAGTTGTCTCTATCTGAAAACTCACAAAACGCAAGTGTCC8560     AGAGGATCTTGGAACACAGATCTGGGCCCATAGCCCTCTA8600     GATCGATCCTCACCATAGCACCCCTTCTTCACGTAAAATA8640     GCTTAGTATAGCATCACATGGCCTGAACACCCCTGGGCCG8680     GGGGGTTCCCCAGAGACCTGGCGGGCGGCTGCCCTGCCTA8720     CTCTGTACACTCGCCTACTCGGGACGATCCGGGCACCAGG8760     GTGTCACCTGAGCTCGCTA8779     (2) INFORMATION FOR SEQ ID NO:5:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 20 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: unknown     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: oligonucleotide     (ix) FEATURE:     (A) NAME/KEY: HF18     (B) LOCATION:     (D) OTHER INFORMATION: Primer used for CYP2C9     mutation detection.     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:     TGCAAGTGCCTGTTTCAGCA20     (2) INFORMATION FOR SEQ ID NO:6:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 21 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: unknown     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: oligonucleotide     (ix) FEATURE:     (A) NAME/KEY: HF2R     (B) LOCATION:     (D) OTHER INFORMATION: Primer used for CYP2C9     mutation detection.     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:     AGCCTTGGTTTTTCTCAACTC21     (2) INFORMATION FOR SEQ ID NO:7:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 20 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: unknown     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: oligonucleotide     (ix) FEATURE:     (A) NAME/KEY: J51     (B) LOCATION:     (D) OTHER INFORMATION: Primer used for CYP2A6     detection.     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:     GGCTTCCTCATCGACGCACT20     (2) INFORMATION FOR SEQ ID NO:8:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 20 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: unknown     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: oligonucleotide     (ix) FEATURE:     (A) NAME/KEY: J61     (B) LOCATION:     (D) OTHER INFORMATION: Primer used for CYP2A6     detection.     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:     GGCTTCCTCATCGACGCACA20     (2) INFORMATION FOR SEQ ID NO:9:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 20 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: unknown     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: oligonucleotide     (ix) FEATURE:     (A) NAME/KEY: Primer B     (B) LOCATION:     (D) OTHER INFORMATION: Primer used for CYP2A6     detection.     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:     AATTCCAGGAGGCAGGGCCT20     (2) INFORMATION FOR SEQ ID NO:10:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 57 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: unknown     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: oligonucleotide     (ix) FEATURE:     (A) NAME/KEY: F4 primer     (B) LOCATION:     (D) OTHER INFORMATION: Primer used for CYP2A6     genotyping.     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:     CCCCTTATCCTCCCTTGCTGGCTGTGTCCCAAGCTAGGCA40     GGATTCATGGTGGGGCA57     (2) INFORMATION FOR SEQ ID NO:11:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 31 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: unknown     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: oligonucleotide     (ix) FEATURE:     (A) NAME/KEY: fragment of F4     (B) LOCATION:     (D) OTHER INFORMATION: Primer used for CYP2A6     genotyping.     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:     CCTCCCTTGCTGGCTGTGTCCCAAGCTAGGC31     (2) INFORMATION FOR SEQ ID NO:12:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 57 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: unknown     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: oligonucleotide     (ix) FEATURE:     (A) NAME/KEY: R4     (B) LOCATION:     (D) OTHER INFORMATION: Primer used for CYP2A6     genotyping.     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:     GCCACCACGCCCCTTCCTTTCCGCCATCCTGCCCCCAGTC40     TTAGCTGCGCCCCTCTC57     (2) INFORMATION FOR SEQ ID NO:13:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 31 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: unknown     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: oligonucleotide     (ix) FEATURE:     (A) NAME/KEY: fragment of R4     (B) LOCATION:     (D) OTHER INFORMATION: Primer used for CYP2A6     genotyping.     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:     CGCCCCTTCCTTTCCGCCATCCTGCCCCCAG31     (2) INFORMATION FOR SEQ ID NO:14:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 54 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: unknown     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: oligonucleotide     (ix) FEATURE:     (A) NAME/KEY: E3F     (B) LOCATION:     (D) OTHER INFORMATION: Primer used to     distinguish CYP2A6v1 from CYP2A6v2.     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:     CCTGATCGACTAGGCGTGGTATTCAGCAACGGGGAGCGCG40     CCAAGCAGCTCCTG54     (2) INFORMATION FOR SEQ ID NO:15:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 20 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: unknown     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: oligonucleotide     (ix) FEATURE:     (A) NAME/KEY: fragment of E3F     (B) LOCATION:     (D) OTHER INFORMATION: Primer used to     distinguish CYP2A6v1 from CYP2A6v2.     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:     GCGTGGTATTCAGCAACGGG20     (2) INFORMATION FOR SEQ ID NO:16:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 57 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: unknown     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: oligonucleotide     (ix) FEATURE:     (A) NAME/KEY: E3R     (B) LOCATION:     (D) OTHER INFORMATION: Primer used to     distinguish CYP2A6v1 from CYP2A6v2.     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:     CGCGCGGGTTCCTCGTCCTGGGTGTTTTCCTTCTCCTGCC40     CCCGCACTCGGGATGCG57     (2) INFORMATION FOR SEQ ID NO:17:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 21 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: unknown     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: oligonucleotide     (ix) FEATURE:     (A) NAME/KEY: fragment of E3R     (B) LOCATION:     (D) OTHER INFORMATION: Primer used to     distinguish CYP2A6v1 from CYP2A6v2.     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:     TCGTCCTGGGTGTTTTCCTTC21     __________________________________________________________________________ 

We claim:
 1. A CYP2A6v2 DNA having a coding sequence shown in FIG. 9 (SEQ ID NO: 1).
 2. The DNA of claim 1 having a genomic sequence as shown in FIG. 10 (SEQ ID NO: 2).
 3. A CYP2A13 DNA having a coding sequence shown in FIG. 11 (SEQ ID NO: 3).
 4. The DNA of claim 3 having a genomic sequence shown in FIG. 12 (SEQ ID NO: 4).
 5. A nucleic acid primer sequence comprising at least ten (10) contiguous nucleotide bases selected from the sequence shown in FIG. 10 (SEQ ID NO: 3).
 6. A nucleic acid primer sequence comprising at least ten (10) contiguous nucleotide bases selected from the sequence shown in FIG. 12 (SEQ ID NO: 4).
 7. A nucleic acid primer sequence selected from the group consisting of:5' GGCTTCCTCATCGACGCACT 3' (SEQ ID NO: 7); 5' GGCTTCCTCATCGACGCACA 3' (SEQ ID NO: 8); 5' AATTCCAGGAGGCAGGGCCT 3' (SEQ ID NO: 9); 5' TGCAAGTGCCTGTTTCAGCA 3' (SEQ ID NO: 5); 5' AGCCTTGGTTTTTCTCAACTC 3' (SEQ ID NO: 6); 5' CCCCTTATCCTCCCTTGCTGGCTGTGTCCCAAGCTAGGCA GGATTCATGGTGGGGCA 3' (SEQ ID NO: 10); 5' GCCACCACGCCCCTTCCTTTCCGCCATCCTGCCCCCAGTC TTAGCTGCGCCCCTCTC 3' (SEQ ID NO: 12); 5' CCTGATCGACTAGGCGTGGTATTCAGCAACGGGGAGCGCG CCAAGCAGCTCCTG 3' (SEQ ID NO: 14); 5' CGCGCGGGTTCCTCGTCCTGGGTGTTTTCCTTCTCCTGCC CCCGCACTCGGGATGCG 3' (SEQ ID NO: 16); or a fragment of said nucleic acid sequences of at least contiguous nucleotides.
 8. A method of determining the presence or absence of an allelic variant in CYP2A6 or CYP2C9 DNA by using the primers recited in claims 5 or 6 or 7 comprising:(a) amplifying an exon containing a variant sequence with in said DNA, producing an extension product; (b) treating extension products with at least one restriction endonuclease under conditions sufficient to produce digestion fragments; (c) analyzing the digestion fragments, for a variant specific digestion fragment or lack thereof.
 9. The method of claim 8 wherein a CYP2C9 variant DNA is being detected.
 10. The method of claim 9 wherein the amplifying step is a polymerase chain reaction using primers comprising HF18 (SEQ ID NO: 5) and HF2R (SEQ ID NO: 6).
 11. The method of claim 8 wherein step (a) is preceded by a gene-specific amplification reaction.
 12. The method of claim 11 wherein the gene-specific amplification is a polymerase chain reaction.
 13. The method of claim 12 wherein a CYP2A6 variant is being detected.
 14. The method of claim 13 wherein a gene-specific amplification reaction uses primers comprising F4 (SEQ ID NO: 10) and R4 (SEQ ID NO: 14) and the exon amplification reaction uses primers comprising E3F (SEQ ID NO: 14) and E3R (SEQ ID NO: 16).
 15. The method according to claim 10 wherein the extension products are treated with the restriction endonuclease AvaII.
 16. The method according to claim 14 wherein the extension products are treated with at least one restriction endonuclease comprising DdeI and XcmI.
 17. A method of determining the presence or absence of an allelic variant in CYP2A6 or CYP2C9 DNA by using the primers recited in claims 5 or 6 or 7 comprising:(a) contacting said DNA with a first primer encompassing a nucleotide variation specific to variant DNA and a second primer which is complementary to a region of said DNA such that upon hybridization and amplification, an extension product will be formed; (b) analyzing the extension products for allelic-variant specific extension products.
 18. The method of claim 17 wherein a CYP2A6 variant DNA is being detected.
 19. The method of claim 18 wherein the amplifying step is a polymerase chain reaction wherein the first primer comprises J51 (SEQ ID NO: 7) and J61 (SEQ ID NO: 8) and the second primer comprises primer B (SEQ ID NO: 9).
 20. A kit for determining the presence or absence of an allelic variant of CYP2A6 or CYP2C9 DNA comprising: at least one nucleic acid primer sequence capable of hybridizing to said DNA selected from teh group consisiting of the primer Seq. I.D. Nos. 3-10, 12, 14, and 16; the kit further containing instructions relating to the determination of the presence or absence of an allelic variant of CYP2A6 or CYP2C9 DNA.
 21. The kit according to claim 20 further comprising amplification components and at least one restriction endonuclease.
 22. The kit of claim 20 wherein the CYP2A6 allelic variant is being detected.
 23. The kit of claim 22 wherein the nucleic acid primers comprise F4 (SEQ ID NO: 10), R4 (SEQ ID NO: 12), E3F (SEQ ID NO: 14) and E3R (SEQ ID NO: 16).
 24. The kit according to claim 20 wherein the CYP2C9 allelic variant is being detected.
 25. The kit according to claim 24 wherein the nucleic acid primers comprise HF18 (SEQ ID NO: 5) and HF2R (SEQ ID NO: 6). 